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ABSTRACT 



The speed of an input speech is changed without any change 
of the pitch of the input speech. Raw data of a speech are 
stored so that the speed of the speech can be modulated 
continuously on the basis of the raw data of the speech. In 
the speech speed conversion method, a speech speed con- 
version process for the input speech is carried out in a period 
designated when speech speed conversion is needed, which 
the speech speed conversion is not carried out in the other 
period. Further, in the speech speed conversion apparatus 
having a unit for inputting a speech, a speech speed con- 
version unit far changing the speed of the input speech, and 
a unit for supplying the output of the speech speed conver- 
sion unit as an output speech to listener's ears, the apparatus 
further includes a speech speed conversion switch, and a unit 
for outputting a speech while changing the speech speed of 
the input speech in a period in which the speech speed 
conversion switch is turned on, but for outputting a speech 
without any change of the input speech in the other period 
in which the speech speed conversion switch is turned off. 

34 Claims, 28 Drawing Sheets 
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AUDIO SIGNAL STORING APPARATUS SUMMARY OF THE INVENTION 

HAVING A FUNCTION FOR CONVERTING ^ of ^ invention is to provide a tech- 

SPEECII SPEED nique for reproducing speech while converting the speed of 

the speech if needed. 

This appUcation is a C^ontinuation-In-P^ rf applfcation 5 of ^ t . g ^ ^ a 

Sen No. 07/931,375, filed on Aug. 18, 1992, abandoned. technique in which raw data of a speech are stored so that the 

BACKGROUND OF THE INVENTION speed of the speech can be converted continuously on the 

basis of the raw data of th ejjpeech. 

The present invention relates to a speech speed conver- yfj^ ^ of t ^ SCT t invention is to provide an 

sion mettiod, a speech speed conversion apparatus, and an „ £ ^ Jxa ^ al unit for using a speech speed conversi on 

electronic apparatus for modulating the speed of a voice, j n conversation, or the HkeT 

and, particularly, to a technique which is effective on appli- -» ^r7 ™. , :"™' w t . „ . „ .. 

catioE to a conteol technique for using such an apparatuTui t A ° f f *• P^"* invention is to povide a 

conversation and so on. technique capable of widening the range of application of 



Conventional devices for aiding hearing for persons hard 



the speech speed conversion apparatus. 



of hearingmainly have used analog type hearing aids which * ^ °/ *? vmaA mvcnti ™ is t0 ^ 1 
use analog circuits for processing mVampHtude and ft* tech ^ c capable of usinga memory of the speech speed 
quency characteristic of a voice. On the contrary, research conversion apparatus effectively. 

and development for using digital signal processing to A further object of the present invention is to provide a 
compensate for hearing-impairment have been made eagerly 20 technique capable of returning a reading pointer of the 
in recent years. The trend of this research and development memory of the speech speed conversion apparatus, 
has been described in detail, for example, in "Application of A further object of the present invention is to provide a 
Digital Technique to Compensation for Hearing- technique capable of controlling an AV apparatus connected 
Impairment", Journal of Acoustical Society of Japan (VoL to the speech speed conversion apparatus. 
47, No. 10, pp. 760-765, 1991) and "Speech-perception aids 25 A further object of the present invention is to provide a 
for hearing-impaired people: Current status and needed technique capable of performing continuous speech speed 
research", J. Acoust Soc. Am (90(2), Pt 1, Aug. 1991). conversion in the speech speed conversion apparatus. 

To compensate for hearing loss, amplification of the A further object of the present invention is to provide a 
amplitude of a speech signal and compression of dynamic 3Q technique capable of attaining improvement in handling 
range are generally performed with every frequency in property of the speech speed conversion apparatus, 
accordance with the hearing characteristic of a user. In the a further object of the present invention is to provide a 
conventional analog hearing aid, such a process is realized technique capable of reducing electric power consumed by 
by an analog circuit. On the other hand, in the digital hearing th e speech speed conversion apparatus, 
aid developed in recent years, this process is realized by a 35 ^ f oxego i n g and other objects and novel features of the 
software such as a digital filter, or the like, so that adaptation foye^on will become clear from the description in 

to the hearing characteristic of the user can be made more in specification and the accompanying drawings. 

^ eta ^* A summary of typical embodiments of the present inven- 

In the aforementioned trend, an attempt to change only the tion disclosed in this application will be described in brief as 
speed of a voice by digital signal processing without any ^ follows. 

change of the pitch of the voice to thereby perform hearing (1) A speech speed conversion method for receiving an input 
aids of higher degree covering the whole range of a hearing speech and changing the speed of the input speech with- 
systeminclusive of the decline of language processing speed ^ changing the pitch of the input speech comprises the 
has been made in recent years. Such a speech speed con- ^eps of: carrying out a speech speed conversion process 
version technique has been described in detail, for example, 45 for me ^put speech in a period which is designated by a 
in 'Development of Portable DSP System for performing listener w hen speech speed conversion is needed; and 
Speech Processing for the Aged", Technical Research carrying out no speech speed conversion process in the 
Report of Institute of Electronics, Information and Commu- period than the designated period, 

nication Engineers of Japan (Vol. 92, No. 207 SP92-54) and ( 2 ) A speech speed conversion apparatus comprises: means 
"High-Quality Real-Time Speech Speed Conversion ^ for xeC civing an input speech; a speech speed conversion 
System", ditto (SP92-55). means for changing the speed of the input speech; means 

In the aforementioned conventional techniques, a broad- for supplying an output of the speech speed conversion 
casted voice over the television/radio or the like, or a voice means as a speech output to a listener's ears; a speech 
recorded in a tape recorder or the like, was used as the voice speed conversion switch apparatus; and means adapted to 
to be subjected to speech speed conversion. That is, the 55 output a speech while changing the speech speed of the 
subject of speech speed conversion was only a voice one- input speech in a period in which the speech speed 
sidedly given to a listener conversion switch is turned on, and to output an output 

Taking into account the fact that the conventional hearing speech without any change of the speech speed of the 
aids can be used without distinction of the input voice in input speech in the other period in which the speech speed 
kind, it is however preferable that the speech speed conver- 60 conversion switch is turned off. 

sion apparatus also can use other voices than the afaremen- (3) A speech speed conversion method for encoding and 
tioned voices as the input voice. Particularly, if the voice of accumulating a raw speech, reading the accumulated 
a talker in conversation can be heard slowly, the apparatus encoded speech and changing the speed of the speech 
can be used not only in the case of hearing perception aids without any change of the pitch of the raw speech, which 
for aged or hearing-impaired people but also in the case of 65 comprises the steps of: carrying out a speech speed 
hearing aids in conversation of a foreign language unf amfl- conversion process for an input speech in a period which 
iar to hearing-unhandicapped people, and so on. is designated when speech speed conversion is needed; 
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and carrying out no speech speed conversion process in 
the other period than the designated period. 

(4) A speech speed conversion apparatus which comprises: 
means for receiving a raw speech as an input speech; a 
memory means for encoding and accumulating the input 
speech; a speech speed conversion means for reading the 
accumulated encoded-speech and for changing the speed 
of the input speech; means for supplying an output of the 
speech speed conversion means as a speech output to a 
listener's ears; a speech speed conversion switch; and 
means adapted to output a speech while changing the 
speech speed of the input speech in a period in which the 
speech speed conversion switch is turned on, and to 
output an output speech without any change of the speech 
speed of the input speech in the other period in Which the 
speech speed conversion switch is turned off. 

(5) In the speech speed conversion apparatus, the memory 
means includes means for storing data by frame. 

(6) The speech speed conversion apparatus further com- 
prises means for determining waveform expansion/ 
reduction processes in the speech speed conversion pro- 
cess on the basis of a comparison between power of a 
frame and a threshold provided as a variable. 

(7) The speech speed conversion apparatus further com- 
prises a speech speed selection switch for selecting the 
speed of the speech, and means for changing the speed of 
the speech to the speech speed selected by the speech 
speed selection switch. 

( 8^The speed conversion apparatus further comprises means 
*jgv crmtmO frtr ^ Titmllfnff an audio/video appara tus^ 

(§)The speed conversion apparatus further comprises a 
repeat switch, and means for repeating a reproduced 
speech in a period in which the repeat switch is turned on. 

(10) In the speech speed conversion apparatus, the repeat 
means includes at least one of means for turning back the 
speech by several seconds whenever the repeat switch is 
pushed once, means for sometimes generating intermit- 
tent sounds while the speech is turned back, means for 
stopping the turning-back of the speech when the speech 
reaches an end of a ring buffer, and means for selecting the 
speech speed at the repeat time. 

(11) In the speech speed conversion apparatus, the means for 
selecting the speech speed at the repeat time has at least 
two modes of default speed value repeat, slow repeat, fast 
repeat and gradually accelerated repeat. 

(12) The speech speed conversion apparatus further com- 
prises a catching-up means far adjusting the quantity of a 
lag from the real time in a period in which the stored 
information is reproduced in the case where the lag from 
the real time is caused by a speech speed conversion or 
repeat operation. 

(13) In the speech speed conversion apparatus, the catching- 
up means includes at least one of means for starting 
catching-up when a slow reproduction mode is 
terminated, means for starting catching-up when repro- 
duction is turned back to the point of time of the start of 
a repeat after the repeat, means for selecting the speech 
speed at the catching-up time, means for automatically 
shifting the current mode to a through mode for directly 
outputting the input speech when catching-up is 
completed, and means for generating a report signal 
sound (message) when catching-up is completed. 

(14) In the speech speed conversion apparatus, the means for 
selecting the speech speed at the catching-up time has at 
least one of means for Tnairing a nonstop skip to the real 
time, means for catching up the real time with fast 
hearing, and means for making a parallel movement with 
a time lag. 
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(15) The speech speed conversion apparatus further com- 
prises at least one of the speech speed conversion switch, 
a speech speed selection switch, a repeat switch, and a 
reset switch which are provided in a peripheral portion on 
a side surface of the speech speed conversion apparatus so 
as to facilitate handling. 

(16) In the speech speed conversion apparatus, the reset 
switch includes means for stopping the repeating or 
catching-up operation and making a skip to the real time 
when the switch is turned on at the repeat or catching-up 
time, and then shifting the current mode to a through 
mode. ,. 

(17) In the speech speed conversion apparatus, the speech 
speed conversion means is provided as a software 
executed by a digital signal processor having an input 
terminal for receiving an interruption request signal from 
the outside, so that controlling of the speech speed con- 
version process or switching of the rate of speech speed 
conversion on the basis of the speech speed conversion 
switch is given to the digital signal processor via the 

■ interru ption request signal input terminai ^ 
(lS) The speech speed conversion apparatus further com- 
prises means for hearing the output speech through a 
binaural headphone. 

( 19) A speech speed conversion apparatus which comprises: 
a microphone for converting an acoustic signal into an 
electric signal; an analog amplifier for amplifying an 
output of the microphone; a low-pass filter for removing 
high-frequency components from the output of the analog 
amplifier; an A/D converter for converti ng.an_outoat,of 
the low-pass rdter/WrHcrllsTan analog signal, into a digital 
sijinal; a digital signal processor for carrying out digital 
sigEaT processing to execute a speech speed changing 
process; a memory means for holding input speech data 
and data obtained as a result of signal processing; means 
for controlling the speech speed changing process 
executed by the digital signal processor; means for chang- 
ing a processing parameter; a D/A converter for convert - 
ing digital speech data into an analog value; a s econd 
low-pass filter tor removing high-frequency components 
from an output of the D/A converter; a second analog 
amplifier for amplifying an output of the second low-pass 
filter; and a headphone for converting an output of the 
second analog amplifier into an acoustic signal and sup- 
plying the acoustic signal to both ears. 

(20) A speech speed conversion apparatus which comprises: 
a microphone for converting an acoustic signal into an 
electric signal; an analog amplifier far amplifying an 
output of the microphone; a low-pass filter for removing 
high-frequency components from an output of the analog 
amplifier; an A/D converter for converting an output of 
the low-pass filter, which is an analog signal, into a digital 
signal; a memory means for holding input speech data and 
data obtained as a result of signal processing; a digital 
signal processor for reading accumulated information and 
carrying out digital signal processing to execute a speech 
speed changing process; means for controlling the speech 
speed changing process executed by the digital signal 
processor, means for changing a processing parameter; a 
D/A converter for converting digital speech data into an 
analog value; a second low-pass filter for removing high- 
frequency components from an output of the D/A con- 
verter; a second analog amplifier for amplifying an output 
of the second low-pass filter, and a headphone for con- 
verting an output of the second analog amplifier into an 
acoustic signal and supplying the acoustic signal to both 
ears. 
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(21) In the speech speed conversion apparatus, die speech intermediate stage, and an electric source supply means 
speed conversion means carries out a series of procedure operated in an analog through mode in which analog 
over a whole frame repeatedly through a pipeline process input-output systems are shcrfcdrcuited so as to be 
by frame with use of a plurality of input frame buffers, the directly connected to each other to thereby stop electric 
series of procedure including: applying a pitch extraction 5 source supply to a digital processing system between the 
process to a leading portion of the frame to detect the analog inputroutput systems when the switch is adjusted 
pitch of the leading portion; transferring data of the length to the intermediate stage. 

of one pitch thus detected to output buffers; multiplying (31) A telephone in which the speech speed conversion 
data of the length of two pitches by a window function means as defined in any one of the above paragraphs (2), 
which changes from 0 to 1 and by a window function 10 (4) and (30) is provided between a handset of the tele- 
which changes from 1 to 0; adding up respective data phone and a body of the telephone, 
obtained by the multiplications by the window functions (32) A telephone line switching system in which the speech 
to thereby generate a reproduced wave pattern having a speed conversion means as defined in any one of the 
time length of two pitches ; inserting the reproduced wave above paragraphs (2), (4) and (30) is provided in a 
pattern in the rear of the pre limin arily transferred data of 15 telephone line switching system, 
the length of one pitch; carrying out a pitch detection According to the feature described in the above para- 
process again while spearheaded by a position at a dis- graphs ( 1) and (2), in the case where a speech is inputted and 
tance of two pitches from the position preliminarily me speed of the speech is changed without any change of the 
subjected to the pitch extraction process to thereby per- pitch of the input speech, a speech speed conversion process 
form pitch detection at the position; and transferring data 20 is carried out for the input speech in a period which is 
of the length of n pitches (n is an integer) based on the designated when speech speed conversion is needed, but no 
pitch length obtained by the final pitch detection to the speech speed conversion is carried out in the other period, 
output buffers. Accordingly, the speech speed conversion apparatus can be 

(22) A speech speed conversion apparatus wherein the used not only for a voice such as a radio voice one-sidedly 
speech speed conversion means is executed only in the 25 given to a listener but also in the situation of conversation, 
case where average power of data in an input frame is so that a voice to be subjected to speech speed conversion 
higher than a preliminarily set threshold, which data can be selected by the listener without any disturbance of 
contained in the frame are directly transferred to the listener's own speech. 

output buffers in the case where the average power is Further, in a hearing aid, a foreign language learning 

lower than the threshold. 30 machine, a telephone, or the like, talker's voice can be heard 

(23) Ih Jne^speech speed conversion apparatus, a second at a slow speech speed without any change of the charac- 
t hreshold is p rovided in the threshold process for the teristic of the talker's voice. 

average po wer ot dW uT The input fra me~^thaT wgen a According to the feature described in the above para- 

frame having lower average power thanTKeT seco nd graphs (3) and (4), in the case where a raw speech encoded 

threshold is continued for a longer time than ajjeflmi- 35 and accumulated is read to change the speed of the speech 

riarily set time threshold, d ata in the frame havingj ower without any change of the pitch of the raw speech, or the 

Average power than the second threshold and continued like, a speech speed conversion process is carried out for the 

for a longer time than the time threshold are forbidden to input speech in a period which is designated when speech 

be transferred to the output buffers. speed conversion is needed, but no speech speed conversion 

CU\ Inthejpeech speed conversion app aratus, th e switc h or 40 is carried out in the other period. Accordingly, in addition© 

eacETbT the switches is c onstituted by a switchwE j^has the effect provided by the paragraphs (1) and (2), there can 

a feeling of soft touch so that t he microphone tloesjopt be provided effective use of the memory, a raw speech repeat 

pi ck Up dlclmolSe oftRe^switcfir function, a voice memory function, a repeat speech speed 

(25j"Ia the speech speed conversion apparatus, tihe switch es conversion function, a fast-hearing reproduction function, 

have respective surface forms different intacaJitysoas to 43 and so on. 

be identified without seeing. According to the feature described in the above paragraph 
{26 pVhc speech speed cm versioXapparatus further com- (5), data are stored by frame, so that writing^eading em- 
prises a rustling prevention means for changing a distance ciency can be improved. 

between the microphone and an apparatus body so that the According to the feature described in the above paragraph 
microphone does not touch clothes directly when the 50 (6), the determination of waveform expansion/reduction 

apparatus body is put into a breast pocket in use. process, silent-part elimination process, etc. in the speech 

(27) The speech speed conversion apparatus further com- speed conversion process is performed based on comparison 
prises a display means which is provided at a predeter- between power of a frame and a threshold, and the threshold 
mined position of the speech speed conversion apparatus is changed in accordance with the loudness of the input 
so that a quantity of a time lag from the real time can be 55 speech. Accordingly, the speech speed conversion process 
indicated visually. 0311 be carried out in accordance with the environmental 

(28) In the speech speed conversion apparatus, a ring buffer condition in use. 

is used as the memory means, and the apparatus further According to the feature described in the above paragraph 

comprises means far managing a lag time by a counter (7), in the speech speed conversion apparatus, there are 
indicating a time lag on the ring buffer. 60 provided a speech speed selection switch for selecting the 

(29) In the speech speed conversion apparatus, a standby speed of the speech, and means for changing the speed of the 
mode for lowering the clock cycle of the processor and speech to the speech speed selected by the speech speed 
carrying out the same process as in the through mode is selection switch. Accordingly, the speed of the speech to be 
provided besides the through mode. heard can be selected by the listener's own will. 

(30) The speech speed conversion apparatus further com- 65 f According to the feature described in the above paragraph 
prises an electric source switch operated at three stages I (8), means (AV control) for controlling an audio/video 
consisting of an ON stage, an OFF stage and an ON-OFF \apparatus is provided in the speech speed conversion appa- 
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ratus. Accordingly, a series of operation in which a signal for frame is higher than a preliminarily set ^f^ 1 ^^ 11 ^ 

pausing the reproducing operation of the external apparatus contained in the frame are directly transferred to the output 

isissufd to temporarily st^ the inputting of the speech to buffers in the case where the average power is lower than the 

the speech peed convex^ a sec0 nd threshold is provided in the threshold 

capacity* I«ffla»t and "T^^^^^ p^^m^v^^^^^^ 

pause signal is stopped to re-start the inputting of the speech £ having lower average power than the 

from the external apparatus when there is some free area in is ^^ed for a longer time than a 

the memory, is repeated irrespective of the expansion/ p^^^y set ^ threshold, data in the frame having 

reduction rate in the speech speed conversion. As a result, [ ower averagc power than the second threshold and contin- 

use of speech speed conversion can be continued for a long 10 ued f Qr a longCf ^ ^ ^ ^ threshold are forbidden 

time. to be transferred to the output buffers. 

According to the feature described in the above para- By ^ aforementioned configuration ofthe speech speed 

graphs (9) to (11), in the speech speed conversion apparatus, conversion means, there can be attained improvement in 

there are provided a repeat switch and means for repeating speech speed conversion efficiency and prevention of low- 

a reproduced speech in a period in which the repeat switch 15 efag Q f the quality of the reproduced speech, 

is turned on. Accordingly, the speech speed conversion of According to the feature described in the above paragraph 

the repeat speech can be carried out (24), the rrricrophone does not pick up click noise of each 

According to the feature described in the above para- switch, so that loud noise at the time of the manipulation of 

graphs ( 12) to (14), a catching-up means for catching up the the switch can be prevented. 

speech to a position of stored information to be heard is 20 According to the feature described in the above paragraph 

provided in the speech speed conversion apparatus. (25), the switches have respective surface formed different 

Accordingly, widening of the range of application of the in tactility so as to be identified without seeing, so that 

speech speed conversion apparatus, reduction in operating handling property can be improved, 

time, improvement in handling property, and so on, can be According to the feature described in the above paragraph 

attained 23 ( 2<5 )> mcrc ^ provided means for preventing the rustle of 

According to the feature described in the above para- clothes in contact with the microphone, so that entrance of 

graphs (15) and (16), at least one of the speech speed noise can be reduced. 

conversion switch, speech speed selection switch, repeat According to the feature described in the above paragraph 

switch and reset switch is provided in aperipheral portion on (27), a display means is provided at a predetermined position 

a side surface of the speech speed conversion apparatus so 30 ofthe speech speed conversion apparatus so that the quantity 

as to perform handling easily. Accordingly, widening of the of a time lag from the real time can be indicated visually, 

range of application of the speech speed conversion Accordingly, reduction in operating time, improvement in 

apparatus, reduction in operating time, improvement in handling property, and so on, can be attained, 

handling property, and so on, can be attained. According to the feature described in the above paragraph 

According to the feature described in the above para- 35 (2$), a ring buffer is used as the memory means, and mere 

graphs (17) to (23), the speech speed conversion means is is provided means for managing a lag time by a counter 

provided as a software executed by a digital signal processor indicating a time lag on the ring buffer. Accordingly, the 

having an input terminal for receiving an interruption repeat process, the catching-up process, and so on, can be 

request signal from the outside, so that controlling of the carried out easily. 

speech speed conversion process or switching of the speech 40 According to the feature described in the above paragraph 

speed conversion rate on the basis of the speech speed (29), a standby mode is provided besides the through mode, 

conversion switch is given to the digital signal processor via so that reduction in consumed electric power can be attained, 

the interruption request signal input terminal. According to the feature described in the above paragraph 

Further, the aforementioned speech speed conversion (30), there is provided an electric source switch operated in 

means carries out a series of procedure over a whole frame 45 three stages consisting of an ON stage, an OFF stage and an 

repeatedly through a pipe-line process by frame with use of ON-OFF intermediate stage so that an analog through mode 

a plurality of input frame buffers, the series of procedure is provided. Accordingly, reduction in electric power can be 

including: applying a pitch extraction process to a leading attained. 

portion of the frame to detect the pitch of the leading According to the feature described in the above paragraph 
portion; transferring data of the length of one pitch thus 50 (31), the speech speed conversion means is provided 
detected to output buffers; multiplying data of the length of between a handset of a telephone and a body of the tele- 
two pitches by a window function which changes from 0 to phone. Accordingly, a speech to be subjected to speech 
1 and by a window function which changes from 1 to 0; speed conversion can be selected by the listener wimout any 
adding up respective data obtained by the multiplications by disturbance of the listener' s own speech, 
the window functions to thereby generate a reproduced wave 55 Further, in the telephone, the voice can be heard at a slow 
pattern having a time length of two pitches; inserting the speech speed without any change of the characteristic of the 
reproduced wave pattern in the rear of the preliminarily talker's voice. 

transferred data of the length of one pitch; carrying out a According to the feature described in the above paragraph 

pitch detection process again while spearheaded by a posi- (32), the speech speed conversion means is provided in a 

tion at a distance of two pitches from the position prelimi- 60 telephone line switching system. Accordingly, the voice to 

narily subjected to the pitch extraction process to thereby be subjected to speech speed conversion can be selected by 

perform pitch detection at the position; and transferring data the listener without any disturbance of the listener's own 

of the length of n pitches (n is an integer) based on the pitch speech. 

length obtained by the final pitch detection to the output Sifl further advantages of the present invention will 

buffers. 65 become apparent to those of ordinary skill in the art upon 

Further, the speech speed conversion means is executed reading and understanding the following detailed descrip- 

onry in the case where average power of data in an input tion of the preferred and alternate embodiments. 



03/17/2004, EAST Version: 1.4.1 



5,717,818 

9 10 

BRIEF DESCRIPTION OF THE DRAWINGS FIG. 27 is a typical view far explaining an accelerator 

' . ^ . _ type switch used in the continuous speech speed conversion 

FIG. lis ablockdiagramshowingthe schematic structure »^ accordin t0 t 

rf ^" h f^' I ^r i v™ Ha2*isablc«kdiagra^ 

HG.2isagraphfeexp^ 5 speed inversion apparatus provided with 

proofs executed within a DSP according to present mvcn- ^ Ay C J^£ S to p^^vention; 

^FIG. 3 is a graph for explaining fee concept of a threshold ™». 29 is a view fa explaining the operation of the AV 

process accorS to presVnt invention; contto1 ***** accardin S 10 mvenUon ; „ . 

FIG. 4 is a view showing the form of use of the speech io FIG. 30 k a flow chart diowing the procedure of a main 

speed conversion apparatus according to present invention; P~ * ** speech speed conversion apparatus provided 

%~ _ , . TV, , . A * J\ rtff . with the AV control means according to present invention; 

FIG. 5 is a flow chart showing the control procedure of the V./" « u - 

speech speed conversion apparatus according to present HG 31 is a flow chart to ^ 

invention; of FIG. 30; 

FIG. 6 is a front plan viewed from the front of the speech u FIGS. 32A to 32C are views for explaining the arrange- 

speed conversion apparatus according to present invention; ™nt of the miaophone in the speech speed conversion 

FIG. 7 is a back plan viewed from the back of the speech apparatus according to present invents; 

speed conversion apparatus according to present invention; HG. 33 is a view showing the structure of a modified 

FIG. 8 is a top plan viewed from the top of the speech ^ cxam P le accordin 8 t0 P™** Mention; 

speed conversion apparatus according to present invention; 20 FIG. 34 is a view for explaining a lag time display means 

FIG. 9 is a left plan viewed from the left of the speech in me S»*h conversion apparatus according to 

speed conversion apparatus according to present invention; present invention; 

FiaiOisarightplanviewedfromtherightofthespeech FIG. 35 is a diagram for explaining an electric source 

speed conversion apparatus according to present invention; * device in the speech speed conversion apparatus according 

FIG. U is a block diagram showing the functional struc- t0 F osent invention; 

ture of the speech speed conversion apparatus according to FIG. 36 is a diagram for explaining an embodiment in 

present invention* which the speech speed conversion means according to the 

FIGS. 12A and 12B are typical graphs for explaining a P™«* invention is applied to a telephone; and 

co mpression proCeSS in a speech compression portion 30 FIG. 37 is a diagram for explaining an embodiment in 

accmdin g to present invention: ~ which the speech speed conversion means according to the 

^RSnTis a flow chart sho^Sglhe procedure of a main P^ent invention is applied to a premises broadcasting 

process according to present invention; system. 

FIG. 14 is a flow chart to be continued from the flow chart ^ DESCRIPTION OF THE PREFERRED 

of FIG. 13; EMBODIMENTS 

FIG. 15 is a state transition view typically showing 

transition between respective modes according to present Embodiments of the present invention are described 

invention* bclow in detail refercnce to me drawings. 

FIG. 16 is a flow chart showing the procedure of areading 40 ^ the drawings for explaining the embodiments parts 

pointer return routine according to present invention; having the same function are marked by the same reference 

FIG. 17 is a flow chart showing the procedure of a ^acter to omit the repeated description thereof, 

one-frame waveform expansion/reduction process accord. FIG 1 is a block diagram showing the schematic structure 

ing to present invention; <* Eternal circuits according to the present invention. The 

FIG 18 isaflowcharttobecontinuedfromtheflow chart 45 reference numeral 1 oesignates a DSP (Digtrt Sgrt 

f FIG 17- Processor); U, a software for performing a speech speed 

° «™ " . * ^ ^ v - *u a ~* conversion process; 12, a serial port; 13, a terminal for 

HG. 19 is a flow chart showing the procedure of a aM fl 14? a fl £ rcgis ter; 2, a memory 

parameter setting process according to present invention; (output buffer); 3? a elector switch; 4, a FTL (Push-Tb- 

FIG. 20 is a view for explaining the data compression Ustcn) converter; 6, a D/A converter; 7, 

process according to present invention; a i 0W .pas S filter; 8, a low-pass filter; 9, an analog amplifier, 

FIG. 21 is a view for explaining the data compression i Ut an analog amplifier; 321, a microphone; and 325, a 

process according to present invention; binaural headphone (earphone). 

FIG. 22 is a view for explaining the data compression ^ a speech speed conversion apparatus according to this 

process according to present invention; 35 embodiment, as shown in FIG. 1, a voice is inputted to the 

FIG. 23 is a flow chart showing the procedure of the total microphone 321 and outputted as a voice signal (an electric 

operation of the speech speed conversion apparatus provided signal). This voice signal is inputted via the amplifier 10 and 

with a continuous speech speed conversion means according the low-pass filter 7 to the A/D converter 5, in which the 

to present invention; voice signal is converted from an analog value into a digital 

FIG. 24 is a flow chart to be continued from the flow chart & value at intervals of a time set in advance, 

of FIG. 23; The voice signal converted into a digital value as 

FIG. 25 is a flow chart showing the procedure of the total described above is inputted to the DSP 1. Then, the speech 

operation of the speech speed conversion apparatus provided speed conversion process of the voice signal is realized by 

with a continuous speech speed conversion means according the software U on the DSP 1. The PTL switch 4 is connected 

to present invention; 65 to the external interruption flag terminal 13 contained in the 

FIG. 26 is a flow chart to be continued from the flow chart DSP 1, so that the state of the FTL switch 4 is expressed as 

of FIG. 25; a numerical value of the flag register 14 which is provided 
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in the inside of the DSP 1 so as to correspond to this terminal 
13. In the software 11 on the DSP 1, a judgment in 
accordance with the numerical value of the flag register 14 
is made as to whether the speech speed conversion process 
is to be performed or not to be performed. 5 

The digital voice data subjected to the speech speed 
conversion process is stored in the output buffer memory 2. 
The D/A converter 6 converts the data of the output buffer 
memory 2 from a digital value into an analog value at 
intervals of a time set in advance. The analog signal obtained 10 
by this conversion is inputted via the low-pass filter 8 to the 
analog amplifier 9 and outputted as a voice from the binaural 
headphone 325 in a listener's favorite amplitude of speech 
signal. 

In this embodiment, two kinds of switches are prepared l5 
for the PTL switch 4. One is a switch in which current 
conduction is made as long as a pushbutton is pushed. The 
other is a switch in which the current conduction state is 
maintained though the hold of the pushbutton is released. 
The former is used in the case of conversation whereas the ^ 
latter is used in the case of continuous speech speed con- 
version of a one-sidedly given voice such as a radio voice 
which is conventional utilization, and the like. Further in this 
embodiment, the selector switch 3 as well as the PTL switch 
4 is connected to the external interruption flag terminal 13 
contained in the DSP 1. The numerical value of the flag 
register 14 is changed by the changeover of the selector 
switch 3, so that the software 11 changes the expansion rate 
of the speech speed conversion process in accordance with 
this numerical value. 3Q 

FIG. 2 is a view for explaining the speech speed conver- 
sion process which is performed in the DSP 1 in this 
embodiment The speech speed conversion process in this 
embodiment is a method for detecting the pitch (basic 
period) of a voice signal and expanding the length of a 3 3 
waveform with the detected pitch as a unit, in which a voice 
data set of the order of tens of milliseconds (hereinafter 
referred to as "a frame") is made a unit for one process. 
Accordingly, at least two frame length input buffers are 
prepared in the inside of the DSP 1 so that while data from ^ 
the A/D converter is inputted to one buffer, data stored in the 
other buffer is processed (pipe-line process). After being 
processed, data is stored in the output buffer 2 having a 
sufficiently large capacity. The procedure of processing data 
in each frame is as follows. 4 , 

First of all, (a) a pitch extracting process (not shown) is 
applied to the head portion of the frame to thereby detect the 
pitch of this portion. 

(b) Then, the thus detected data of the length of one pitch 
is transferred to the output buffer Z 50 

(c) Then, data of the length of two pitches is multiplied by 
a window function which changes from 0 to 1 and by a 
window function which changes from 1 to 0. 

The on-data positions from which the multiplications by 
the window functions are started are however shifted by one 55 
pitch. Then, data as respective results of the multiplications 
by the window functions are added to each other to generate 
a reproduced wave pattern having the time length of two 
pitches and put the pattern in the rear of the one-pitch data 
which was transferred in advance. 60 

(d) Then, a pitch detecting process (not shown) is carried 
out a gain in the condition in which a position two pitches 
away from the on-data position previously subjected to the 
pitch extracting process is at the head, so that pitch detection 
in this position is performed. Because the voice pitch 65 
generally always varies, a pitch different from the previously 
detected pitch is obtained in the second detection. 
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(e) Data of the length of n pitches is transferred to the 
output buffer with the pitch length obtained by this final 
pitch detection as a unit 

The aforementioned procedure of from (a) to (e) is 
repeated over the whole frame. 

Because the pitch length depends on the input voice, the 
number of repeats in one frame is not constant Further, 
different expansion rates are realized by changing the value 
of n in the aforementioned step (e). For example, data of the 
length of four pitches is generated from data of the length of 
three pitches in the input buffers in the condition of n=l, so 
that the expansion rate becomes 4/3=1.33 times. Similarly, 
the expansion rate becomes 1.50 times in the condition of 
n=0 and 1.25 times in the condition of n=2. 

Further, in this embodiment, the aforementioned speech 
speed conversion process in FIG. 2 is not always applied to 
all frames but the aforementioned process in FIG. 2 is 
applied only in the case where the calculated average power 
of each frame exceeds a threshold Th which was set in 
advance. Data in a frame having power not exceeding the 
threshold Th is therefore transferred to the output buffer in 
its original condition. FIG. 3 shows a concept of this 
threshold process. 

In FIG. 3, the portion in which the power of each frame 
exceeds the threshold Th is expressed as a duration of 
expansion. Because the leading and trailing portions of the 
voice signal are not processed but outputted in their original 
condition by this threshold process, there is an advantage in 
that voice characteristic contained in the leading and trailing 
of the voice, for example, consonantal characteristic, is not 
destroyed. 

Bather, in this embodiment, a second threshold To is 
provided in the threshold process for the average power of 
each frame as shown in FIG. 3. In the case where a frame 
having power lower than this second threshold To is con- 
tinued for a time not smaller than one second, the frame 
having power lower than the threshold To continuously for 
a time not smaller than one second is therefore processed so 
as not to be outputted. Accordingly, reduction in the quantity 
of data stored in the output buffer is attained. 

In FIG. 3, this not-outputted portion is expressed as a 
duration of eumination. At the output buffer 2, data are one 
by one outputted to the D/A converter 6 at regular time 
intervals in parallel with the writing of the speech-speed- 
conversion-processed data at once each frame. Addresses in 
the output buffer 2 are set in the form of a ring so that the 
last address is continued to the first address. 

Accordingly, in this ring-like address space, an operation 
is carried out so that an address pointer Po which points data 
to be fed to the D/A converter runs after an address pointer 
Pi which points the destination of the writing of the speech- 
speed-conversion-processed data. In this embodiment, Pi 
will overtake Po sooner or later because the speed of Pi is 
higher than the speed of Po. At this point of time, informa- 
tion which has been stored in the output buffer 2 is not 
outputted but rewritten. 

Accordingly, the time from the start of the speech speed 
conversion operation to this state becomes a time length of 
the input voice which can be tackled by the speech speed 
conversion process of this embodiment The reduction in the 
quantity of data based on the aforementioned threshold To 
has an effect that this time length which can be tackled is 
made long. 

Further, the signal processing method in the speech speed 
conversion process explained above with reference to FIGS. 
2 and 3 has been reported in "Evaluation of Speech Speed 
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Conversion Method by Hearing-Impaired People" Technical As shown in FIGS. 6 to 10, in the speech speed conver- 

Research Report of Institute of Electronics, Information and sion apparatus of this embodiment, the slow switch 104, the 

Communication Engineers of Japan, SP92-150 (1993-03) or repeat switch 105 and the reset switch 106 are provided in 

4 T)iscussion of Speech Speed Conversion Method using positions where the body 101 of the speech speed conver- 

Portable DSP System" Proceedings of Acoustical Society of 3 s i on apparatus is easy to operate with one hand, for example, 

Japan (March 1993), 1-7-6. in the front upper side portion, and the speech speed 

FIG. 4 is a view showing the form of use of the speech changeover switch 113 is provided in the right plan view, 

speed conversion apparatus according to thisembodimenL The pu^^on me aforementioned slow switch 104 is 

Althou^ FIG. 4 shows the case where the FIT. switch 4 is fonned so as t0 ^ larger than the other pushbuttons because 

disposed on meuppe^ 10 ^ of usimlg of itis higher. Further, because the 

switch 3 for changing the expansion rate of speech speed pushbutton is ^de4 so mat a can be fee^ For examr^ 

conversion is p4ared in a%ide of the FIX. switch 4. mereareuse4(^ 

BecausemesdeXswitch3asweUastheFrLswitch4is „ * \?*** * * >P» shed sM late^ (2) a double 

provided so that the state of the selector switch 3 can be 15 m ^ toc . J 0 *? Wh ^ 1 15 

Observed through the external interruption flag terminal of ^TiV?^?^^^ 

the DSP 1 from me software on the DSP 1, the value of n in iS rcleased when * e resct P****** is pushed, and so on. 

the aforementioned speech speed conversion process is The aforementioned speech speed changeover switch 

changed in accordance with the state of the selector switch (speech speed setting switch) 113 is disposed, close to a 

3 when the PTL switch 4 is pushed. The expansion rate can range allowing the operation thereof with the same finger so 

be changed for every speech by operating the FTL switch 4 that this switch and the slow switch 104 can be operated 

and the selector switch 3 alternately. alternately. 

FIG. 5 shows the aforementioned control procedure Besides the position in the aforementioned embodiment, 

expressed in a flow chart. In the speech speed conversion ^ a ring switch, a slide switch, and so on, may be used to make 

process, for example, a flame of the time length of the order the operation easier. 

of tens of milliseconds is used as a unit of processing. The The aforementioned voice volume 108 is also disposed in 

A/D conversion and D/A conversion are processes which are a range allowing the operation with the same finger so as to 

carried out at regular intervals of a smaller time pitch than be easy to adjust in order to always make hearing In 

that, for example, at regular intervals of a time pitch of the 3Q appropriate voice volume possible, 

order of, for example, tens of microseconds. As shown in Further, a switch which has a feeling of soft touch so that 

FIG. 5, the A/D conversion, the D/A conversion and their mc ^oTophonc 321 does not pick up click noise of the 

attendant process are realized as an interruption process. switch is preferably used as the aforementioned switches 

While the speech speed conversion process and a process of hi^i in the frequency of use, such as die slow switch 104, the 

waiting for interruption are carried out, the mterruption 35 rcpeat switcb 105 me reset switch 1Wf me speec h speed 

process is carried out in accordance with an interruption changeover switch 113, and so on. For example, a switch 

signal from the serial port to which the A/D converter and uging c i ec trically conductive rubber or the like is used, 

the D/A converter are connected. Further, the external appearances of the aforementioned 

As is obvious from the above description, in accordance tivc m ^^ly formed into surface states 

with this embodiment, the speech speed conversion appa- ^ wh £h ^ Wmni ia me fecling m mda: to idcnti fy 

rates can be used notqi^ mc hi kind without seeing, 

a listener like a radio broadcasting voice but also in the 0 

situation of conversation, so that the listener can select the W»» the aforementioned finger stop hollow W3 is 

voice subjected to the speech speed conversion without any a structure is made so that some switches such as a 

disturbance of listener' fowii sf^ch, 45 s ^ ch V°* SWItch * re P eat md 80 on > m 

Further, the speech speed conversion apparatus of this scen * 

embodiment can be used to compensate for the deterioration The internal circuit structure of the speech speed conver- 

of voice hearing ability as observed in the aged or the like. sion apparatus in this embodiment is formed so as to be 

It is further needless to say that the apparatus can be used identical to the aforementioned circuit structure shown in 

even in the situation in which a listener who has no difficulty 50 

in hearing hears an unfamiliar foreign language. As the FTL switch 4 in the previous embodiment, there 

Referring to FIGS. 6 through 10, the external appearance are used the slow switch 104, the repeat switch 105, the reset 

structure of the speech speed conversion apparatus accord- switch 106, and so on, as described above. Further, as the 

ing to the present invention is shown in the following. FIG. selector switch 3 in the previous embodiment, there is used 

6 is a front plan view from the front; FIG. 7 is a back plan 55 the speech speed changeover switch (speech speed setting 

view from the back; FIG. 8 is a top plan view from the top; switch) 113. Further, the speech speed changeover switch 

FIG. 9 is a left plan view from the left; and FIG. 10 is aright (speech speed setting switch) 113 is connected to the exter- 

plan view from the right nal interruption flag terminal 13 contained in the DSP 1. The 

In FIGS. <S to If, the reference numeral 101 designates a numeral value of the flag register 14 is changed by the 

body of the speech speed conversion apparatus; 102, a back 60 changeover of the speech speed changeover switch 113, so 

cover; 103, a finger stop hollow; 104, a slow switch (slow that the software 11 changes the expansionrate of the speech 

pushbutton); 105, a repeat switch (repeat pushbutton); 106, speed conversion process in accordance with this numerical 

a reset switch (reset pushbutton); 321 , a microphone; 108, a value; ^ 

voice volume; 109, an electric source switch; 110, an FIG. 11 is a block diagram showing the functional struc- 

earphone terminal; 111, an external input terminal; 112, an 65 ture of the speech speed conversion apparatus in this 

AV control terminal; and 113, a speech speed changeover embodiment, in which the reference numeral 21 designates 

switch (speech speed setting switch). speech input devices; 22, input buffers; 23, a central pro- 



03/17/2004, EAST Version: 1.4.1 



5,717,818 

15 16 

cessing unit (CPU); 24, a ring buffer memory (which cor- per one data is allocated thereto, and the input buffers 22 are 

responds to the memory 2 in FIG. 1); 25, a function chooser; realized by the allocation of a part of addresses on the 

26, output buffers; and 27, speech output devices. memory 2 shown in FIG. 1. 

> *^ , , i » y The controller 23E shown in FIG, 11 monitors the state of 

Making the constituent parts of tJus embodiment corre- fc esein 22 and transfers speech data of the length 

spond to those of the previous embodiment, the speechinput 3 rf onc ^ t0 ^ cm:0 ^x23A whenever the input buffers 

devices 22 are constituted by the microphone 321, analog ^ are filled with the data of the length of one frame, 

amplifier 10, low-pass filter 7 and A/D converter 5 of FIG. ^ ^ ^^der 23A, the input speech data of the length of a 

1. one frame is subjected to an information compression \ 

The aforementioned input buffers 22 serve to hold a process, so that the data as a result of the compression is held \ 

speech converted into a digital signal by the aforementioned 10 in the ring buffer memory 24. Several methods are consid- \ 

speech input devices 21 and have a size enough to hold data cred as this c ompression process. O ne example thereof is a I 

of the length of one frame which is a unit for signal difference date noiaing metnod shown in FIGS. 12A and / 

processing after that. These input buffers 22 can be realized 12B. FIGS. 12A and 12B are typical graphs for explaining / 

by the allocation of a part of addresses of the ring buffer the compression process in the encoder 23 A in this embodi- 

memory 24 (which corresponds to the memory 2 in FIG. 1). 15 ment 

Ihe aforementioned ^ ou^^= 

^^"^^^ h c^™* each frame. In FIG. 12A, these difference data are expressed 

theDSPlshowninFIG. 1, has an encoder 23A, a silent-pait ™™ ^ ^ ^ rf ^ COSBStt$a ^^ B 

elimination process 23B, a decoder 23C, a wave-farm ^ ^ ^ ohuined ^ mc aforementioned differ- 

manipulation process (speech speed conversion process) cncc dm A1> ^ _ , ^ fa C ode length of 8 bits per one 

23D, and a controller 23E. dividing foe leading data of the frame into upper 

The aforementioned function chooser 25 which corre- g ^its ^ lower 8 bits. One data of the input data has a 

sponds to the portion constituted by the switches 3 and 4 and digital code length of 16 bits. In the case of an input signal 

the external interruption flag terminal 13 shown in FIG. 1, is ^ suc h as a voice signal which changes sufficiently slowly 

constituted by the slow switch 104, the repeat switch 105, compared with the sampling interval, the difference from the 

the reset switch 106, the speech speed changeover switch previous sampling value is however not so large that the 

113, and so on, as described above. difference can be expressed sufficiently in the code length of 

"'^-rRe^toranentioned output Duffers 26 which serve to hold 8 bits which is a half as shown in FIG. 12B. The capacity of 

resulting data processed by the aforementioned wave-form M data after the compression process is therefore about a half 

manipulation process 23D are two in practice and each of * thecapacity of data before the compression process but there 

them has a size enough to store data of the length of one if no missing from the contents thereof as long as the 

" ~7. 64 , , - J* „ M , rfrtilc difference in the middle of the process does not become too 

hne process is realized by usmg mem alternately, whereas in 35 QQjjjpj essB( i into a half capacity for each frame are arranged 

this embodiment a pme-hnepr^ on ^ c ^ buffcr memory 24 so that the time sequence 

output buffers alternately in the same manner as in the th^f ^ Maintained. 

previous embodiment addition to this, a frame header is added to the leading 

That is, while the wave-form manipulation process of one 0 f me compressed data of each frame in order to indicate a 

frame is carried out so that a result of the process is held in ^ break between frames. In the compression process portion, 

one output buffer, a result of the wave-form manipulation me calculation of the sum of the absolute values of all data 

process obtained in the previous cycle is outputted from the m ^ e frame as well as the aforementioned compression 

other output buffer via the speech output devices 27. These process in FIG. 12 is carried out and, at the same time, the 

output buffers 26 can be realized by the allocation of a part WO ik of recording a result thereof as the power value of this 

of addresses of the ring buffer memory 24 (which corre- 45 frame m the aforementioned frame header portion is carried 

sponds to the memory 2 in FIG. 1). ou t. 

The inputting of data to the input buffers 22 and the The determination of a frame to be subjected to the 

outputting of data from the output buffers 26 are carried out waveform expansion/reduction process is performed on the 

at intervals of the sampling rate of the A/D converter 5 and oas is of comparison between the power of the frame and the 

of the D/A converter 6 in the same manner as in the previous 50 threshold Th. Further, the silent-part elfcrunation process is 

embodiment. The process executed by the DSP 1 is therefore carried out on the basis of comparison between the power of 

constituted by a wave-form manipulation process for each the frame and the threshold To. 

frame and an interruption process executed at sampling it is preferable that these thresholds are not used.as fixed 

intervals. values but changed in accordance with the loudness of the 

That is, the interruption process is executed any number 55 input voice. For example, between the case of use in a quiet 
of times while the wave-form manipulation process is room and the case of use in a situation of large backpound 
applied to data of the length of one frame, so that the two noise, speech speed conversion, of course, cannot be per- 
processes are executed apparently and simultaneously. formed well unless these thresholds are adjusted welL 

As the aforementioned ring buffer memory 24, there is In a specific realization method, the maximum /mi nimum 

used a well-known type memory in which writing/reading is 60 values of frame power in the past period of several seconds 

performed for each frame. The details thereof will be are stored so that the aforementioned thresholds are deter- 

described below. mined on the basis of these values. For example, in the case 

(Writing Operation) where these thresholds are to be changed at intervals of five 

In FIG. 11, speech data inputted through the speech input seconds in the condition in which the time length of one 

devices 21 are held in the input buffers 22. The input buffers 65 frame is 50 milliseconds (msec), the process of changing the 

22 have a capacity enough to hold a number of data threshold Th can be carried out once whenever 100 frames 

corresponding to one frame so that the code length of 16 bits are processed. 
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As described above, the power of each frame is always 
calculated with respect to all inputs whenever information 
compression is performed for each frame by the encoder in 
FIG. 11, so that information thereof is recorded in the frame 
header and held in the ring buffer 24. 

In this calculation of frame power, the maximum frame 
power Pmax and the minimum frame power Pmin are 
compared with each other so that they are updated if 
necessary. If the marimnm frame power Pmax and the 
minimum frame power Pmin are provided so as to be reset 
at intervals of five seconds (100 frames), the maximum 
frame power and the minimum frame power in the past 
period of five seconds can always remain. 

In the calculation of the thresholds, for example, Th and 
To are set to 10% and 5% the difference between the 
maximum frame power Pmax and the minimum frame 
power Pmin, respectively. These are given by the following 
expressions (1) and (2). 

Th=lPmax-Pmin\*Q.lCH-Pmm (I) 

Tc^\Pmax-Pmnm.Q$+f > mn (2) 

Although the method of holding raw data in the ring 
buffer memory 24 in this embodiment has been described 
above, the details of silent-part elimination will be described 
below. 

As explained in the previous embodiment with reference 
to FIG. 3, the function of silent-part elimination serves to 
eliminate a silent part (a duration in which power is lower 
than the voice-part/silent-part threshold To) continued for a 
time not smaller than one second. 

The silent-part elimination process is carried out by the 
silent-part elimination process 23B shown in FIG. 11. This 
silent-part elimination process is a process independent of a 
later-described process executed for each frame (hereinafter 
referred to as a main process) so that the process is carried 
out after the main process for one frame is laminated. In 
FIG. 14, the process is carried out between the judgment 
"delay=0r (S143) and the judgment "Power ON?' (S144) 
(though not shown). 

In the silent-part elimination process 23B, data accumu- 
lated in the input buffers 22 are added up at intervals of a 
predetermined unit (for example, l A frame) to calculate 
power, so that the silent-part elimination operation is started 
when the power "crosses the vaice-part/silent-pait threshold 
upwards 9 *. This is because the point of time of termination of 
the silent part is the point of time of the change of power 
from a small value to a large value and, at any point of time 
except this point of time, a judgment cannot be made as to 
whether the silent part continued up to that is longer than one 
second or not 

When the silent-part elimination process is started, first 
the frame header of the ring buffer memory 24 is retrieved 
retroactively to the past Compressed data on the ring buffer 
memory 24 are compressed for each frame and, as described 
above, the power value of the frame is recorded in the frame 
header. If a frame having power lower than To is continued 
for a time not smaller than one* second, silent-part elimina- 
tion is enabled and the input pointer to the ring buffer 
memory 24 is returned to the point of rime in which the silent 
part has been continued for one second. The input of the next 
compressed data is recorded so as to be overwritten from the 
returned point of time. Accordingly, the silent part continued 
for a time not smaller than one second just before the current 
point of time is always eliminated. 
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(Reading Operation) 

The later-described main process in the apparatus of this 
embodiment is carried out for each frame. The wave-form 
manipulation process 23D shown in FIG. 11 therefore holds 

5 currently processed frame data, so that reading from the ring 
buffer memory 24 is performed collectively for each frame. 
That is, because addressing to the ring buffer memory 24 can 
be made easily by a process of increasing the address one by 
one simply in the case where data are collectively picked 

10 out, this case is better in efficiency than the case where data 
are one by one picked out 

Because the data stored in the ring buffer memory 24 are 
compressed data as described above, it is necessary that this 
compression is decoded into the original data before the 

13 wave-form manipulation process. The decoder 23C shown 
in FIG. 11 is provided for this purpose. First, leading two 
8-bit data are arranged in the upper/lower of 16 bits with 
one-frame compressed data as an input to generate a leading 
data. Then, the value of the third data of the compressed data 

20 is added to the leading data to restore the second data. Then, 
the value of the next data of the compressed data is added to 
the second data to restore the third data. Thereafter, the work 
of adding the compressed data to the previously restored 
data successively is repeated thus to restore all data of the 

25 frame. 

The basic operation of the speech speed conversion 
apparatus in this embodiment will be described below in 
brief. 

As shown in FIG. 11, the speech converted into a digital 

30 signal by the speech input devices 21 is first inputted to the 
input buffers 22. The speech signal read from the input 
buffers 22 is fed to the encoder 23A contained in the CPU 
23 of DSP1 (FIG. 1), subjected to the data compression 
process and stored in the ring buffer memory 24. The 

33 aforementioned speech signal is also fed to the silent-part 
elimination process 23B so that the silent-part elimination 
process is applied to the data stored in the ring buffer 
memory 24 if necessary. 
Th&jjata of the speech signal stored in the ring buffer 

40 memory lA are rram e-Dy-rra me fed to the decoder 23CT so 
that the compressea soeecn data are decoded by the decod er 
23C and inputted to the wave-form manipulation process 
(spe ech speed conversion process) 23D. In the wave-tor m 
manipulation process (speech speed conversion process; ) 

45 23D, there is carried out speech speed conversion or the li ke 
onthe basis of the condition set by the function chooser 25. 
The digital speech data subjected to the speech speed 
conversion process or the like are held in the output buffers 
26. The data of the output buffers 26 are read out so that the 

so speech subjected to the speech speed conversion process or 
the like is outputted from the speech output devices 27. 

That is, the data of the output buffers 26 are read out so 
that the data are converted from a digital value into an 
analog value at intervals of a set time by the D/A converter 

53 6 as shown in FIG. 1. The analog signal thus obtained by this 
conversion is inputted to the analog amplifier 9 via the 
low-pass filter 8 and outputted as a voice from the binaural 
headphone 325 in listener's favorite amplitude of speech 
signal. 

60 Referring to FIGS. 11, 13 and 14, the process executed for 
each frame (hereinafter referred to as a main process) in this 
embodiment will be described below. 

FIGS. 13 and 14 are flow charts showing the procedure of 
the main process in this embodiment 

65 As shown in FIG. 13, in the main process in this 
embodiment, the 'fade-in" step is carried out (S131) with 
Powering ON. That is, just after the powering-on of the 
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electric source, data stored in the output buffers 26 are (msec). On the other hand, switcMng devices ^ict . are such 

indefinite. Just after the powering-on of me etedric source, that the pushed states are maintained for a tto not shorter 

Shaving no relation to STspeech may be therefore than the duration ^KlXSS 

J , J . . V\ ^ /„^n***H Jnfcirt where the respective switches (pushbuttons) are pusneo oy 

outputted. In the case where the data « < , Tufer, are used in this apparatus. Accordingly, the situation 

fromthe speech output devices 21 )** ^^^TZ of the routine can be shifted to a desired operation withsuch 
of a very large level. To prevent this, in this embedment, fee f rf ^ reg ^ not gi ven to ^ 

values of data in the output buffers are adjusted by the ^ as £ foe pus hed states of the switches are checked 
execution of the fade-in step so that the output of the speech foe one-frame process is carried out. 

output devices is increased gradually for a predetermined Fjrs ^ whcthcr ^ ^seX switch 106 is pushed down or not 
time after the powering-on of the electric source irrespective 10 {& ^^ed (S138). If the reset switch 106 is pushed down (in 

of the data in the output buffers. Specifically, whenever one ^ case of y es ^ $128), the current mode forcedly goes to 

data is transferred from the output buffers to the VIA ^ through mode at this point of time, 
converter, the value of this data is multiplied by a coefficient, if the reset switch 106 is not pushed down (in the case of 

so that this function is realized by changing the value of the No in S138), whether the slow switch 104 is pushed down 
coefficient with the passage of time. This operation is 15 or aot i s checked (S139) as shown in FIG. 14. If the slow 

executed by the controller 23E shown in FIG. 11. switch 104 is pushed down (in the case of ON: in the case 

Thereafter, the "through mode" process is started. In the of Yes in S13»), the situation of the routine goes back to the 

through mode process, first, the "reading pointer coinci- routine in which parameter is set in the expansion process so 

dence" step is carried out (S132). This reading pointer that the wave-form expansion process is applied to the next 

pointer pointing an output data address onineraary just after ?P ^ ^ ^ K ^ ^ switc h 104 is opened (in the v 

the inputting of data to the input buffers 22. In FIG. 11, this of 0FF: m the case of No in S139), the situation of flic \ 

operation is carried out by the controller 23E. routine goes to the next judgment as to the repeat pushed- \ 

After the reading pointer coincidence, in the through down state (S140). The case where the pushing-down of the \ 

mode, the pushed states (ON states) of the slow switch 104 30 repea t s jgftcfelOS b detected at this point of tune is either the I / Q rv ^ 

and the repeat switch 105 are checked (S133 and S144). In whereSherepeat switch is pushed atthe time of repeat I \X Q/ 

the case where both switches are in non-pushed states (OFF reproduction" or the case where "the repeat switch is pushed \ 

states), the situation of the routine goes back to me previous at me 0 f catching-up reproduction". In either case, the 1 

reading pointer coincidence step (S132) so that the through situation of the routine branches into me reading pointer I 

mode is continued. Accordingly, in the interruption process 35 return routine so that the repeat reproduction is started from / 

which occurs while the through mode is continued, input the silent part near a position returned back to the past by / 

data is always outputted intact, so mat the same speech as the about five seconds from me current position of the output^/ 

input speech is outputted from the speech output devices 27. pointer of the ri ng buffer memory 24. TTZr 

In FKJ. 11, the aforementioned respective switches such "Tn the case where the slow switch 104 is opened and fte 

as the slow switch 104, the repeat switch 105 and the reset 40 repeat switch 105 is not pushed down, the situation of the 

switch 106 are contained in the function chooser 25 and the routine goes to the following repeat end judgment (S141). 

states thereof are checked by the controller 23R The repeat operation is continued until the ouh?ut pointer 

When the repeat switch 105 is pushed (turned ON) in the goes back to me output pointer position where the through 

aforementioned through mode, a repeat flag (not shown) mode was changed to me repeat operation by the vmt- 

Se^d separately is set from 0 to 1 and the "reading 45 down of the repeat switch 105. That is, in the case where this 

pointer return" routine is carried out (S135). FIG. 16 shows judgment shows that Ihe repeat mode is used currently and 

allow chart of the internal procedure of this reading pointer that the position of the output pointer does not yet go back 

return routine. The explanation of FIG. 16 will be described to the output pointer position whew the repeat was started, 

later a processing loop is formed so mat the situation of the 

When the slow switch 104 is pushed in the aforemen- 50 routine goes back to the aforementioned one-frame wave- 

tioned through mode, the situation of the routine skips to the form expansion/reduction process. The subsequent process 

routine for "setting parameter in expansion process" (S136) is a process for catching-up reproduction, 

as shown in FIG. 13. FIG. 17 shows a flow chart of the After the repeat reproduction is terminated ca ^atter Jhe 

internal procedure of this routine. The explanation of FIG. slow reproduction is terminated, the situation of toe routine 

17 will be described later. « goes to the catching-op reproduction The ca^g-up 

After trie setting of parameter in the expansion process is reproduction means an operation in which a time lag from 

performed, the one-frame waveform expandonfteduction the real time as caused by the repeat or slow reproduction is 

process is carried out (S137). FIGS. 18 and 1* show flow made up for by fast reproduction realized by the repetition 

charts of the internal procedure of this one-frame waveform of the one-frame waveform reduction process. Inthe ^process 

expansionprocess.TheexplanationofFIGS.18andl9wm <o in this portion, the setting of parameter is prformedfor the 

be described later. waveform reduction process for the catching-up reproduc- 

After the aforementioned one-frame process is completed, Hon (S142). 

the situation of the routine goes to the step for checking the The quantity of the lag from the real time increases when 

states of the respective switches as to whether each switch the repeat button is pushed down or when the waveform 

is pushed or not Hereupon, because the one-frame process 65 expansion process is earned out. On the contrary, it 

is terminated within the time length of one frame, the decreases when the waveform reduction process is carried 

process is completed in the order of tens of milliseconds out 
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The process for increasing/decreasing this quantity of the FIG. 16 is a flow chart showing the procedure of the 

lag is however not shown in FIGS. 18 and 19 which show reading pointer return routine. 

the procedure (flow chart) of the one-frame waveform The reading pointer return routine in this embodiment is 

expansion/reduction process which will be described later. a specific method for changing the value of the output 

Ajudgrnentismadeastowhettate 5 pointer pointing the position of data to be read from the ring 

from the real time is present or absent (S143). In the case DU fre r 24, which method is necessary for realizing a repeat 

where me quantity of me time hg is function. 

loop is formed so mat the catching-up reproduction is M sn0 wn in FIG. 16, first, the position of the output 

continued. That is, the operation in which the catching-up pointer at the current point of time is set to Pout (S161). 

reproduction is continued until the quantity of the time lag 1Q ^ quan tity 0 f the lag from the real time at the current 

becomes zero, is realized by this judgment mt 0 f time is set to D (S162). 

On the other hand, in the main process described above, * A j udgment & ^ to whether the quantity of the lag 

the time lag from the real time as caused by the speech speed ^ tf ^ amA int rf ^ w mat me 

conversion or repeat operation is managed as lag quantity ^ ^ Jflg ^ cxcccd mc ^ of ^ ^ bu ffer 

by using a counter, , M . 13 memory 24 if the quantity of the lag is further increased by 

Although the time lag from the real time can be ?SdsW) (SI©). lithe case 4ere a decision is made 

also as difference between the position on the ncg buffer 24 J awwuua \*> j\s> **>j ... , u 

where the current sampled datais inputted and the position as a result of the payment so that the q^nhty of &e lag 

on the ring buffer 24 where the position of data outputted is exceeds the size of the ring buffer memory 24 (the case of 

inputteMhat is, as difference between addresses pointed by Yes in S163), this routine is terminated without any change 

two pointers, the management method using the lag quantity 20 of Pout and D (S169 and S170). 

counter as described above is .employed in the present In the case where the quantity of the lag can be increased 

invention. This is because the quantity of the lag may be by five seconds (B5) (the case of No in S163), the pointer is 

unable to be expressed correctly in the address difference returned back by five seconds (-B5) and the quantity of the 

between the input and output pointers on the ring buffer 24. lag is increased by five seconds (+B5) (S164). 

For example, assuming that the memory address space 25 Then, a process of searching back for the silent part is 

allocated to the ring buffer 24 is from address 0 to address started so that the start of the repeat is made a pause of the 

1000, then the ring buffer 24 is realized by the handling of speech. First, data is accessed backward from the position 

the memory address space in a manner of "next to address pointed by Pout on the ring buffer memory 24 to thereby 

1000, jump to address 0" in the program. Therefore, in the ulaAatt one-frame power (S165). 

case where the input and output pointers lie across this break ^ Mfids ^ mc output pointer is returned back (-F) by 

between addresses, the quantity of data therebetween cannot framc _ ^ tity of the lag is also further increased 

be expressed easily by taking the difference between address ^ ^ a judgment is made as to whether 

tome speech speed conversion apparatus according to the the lag exceeds the size of the ring buffer memory (the case 

present inwntion^wnenever readingAvriting of data is per- of Yes in S166), this search forthe silent part is stopped and 

formed with respect to the ring buffer 24, the value of the lag Pout and D at this time are set as the output pointer vale Mid 

quantity counter is changed to thereby manage the quantity 40 the lag quantity respectively (S169 and S170) whereafter 

of the time lag to prevent me increase of the quantity of this routine is terminated. 

processing based on the complex address calculation. In the case where the total quantity of the lag does not 
The aforementioned main process is provided in the form exceed the size of the ring buffer memory (the case of No in 
of an infinite loop in which the aforementioned process is S166) though the output pointer is returned back by one 
repeated until the electric source switch is turned off (S144). 45 frame, Pout is returned back by the length of one frame and 
In the case where me electric source switch is turned off, the quantity D of the lag is increased by one frame (S167) 
the process is not suddenly stopped but continued for a whereafter the calculated one-frame power W and the voice- 
predetermined time before it is stopped (mute) (S14S). part/sfien^part threshold are compared with each other 
During this time, here is carried out such a process that the (S168). In the case where the one-frame power W is smaller 
loudness of the output voice is reduced gradually. 50 than this threshold, a decision is made that a pause of the 
Specifically, in an interruption process in the same manner speech is present near this frame (the case of No in S168) 
as in the fade-in operation which is the first step, whenever and Pout and D at this time are set as the output pointer value 
one data from the outout buffers 26 is transferred to the D/A and the lag quantity respectively (S169 and S170) where- 
converter 6 shown in FIG. 1, the value of this data is after this routine is terminated. 

multiplied by a coefficient so that this function is realized by 55 In the case where the one-frame power W is larger than 

changing the value of this coefficient wilh the passage of this threshold (the case cf Yes in S168), the pointer is further 

time. This operation is carried out by the controller 23E returned back by one frame and the search for the silent part 

shown in FIG. 11. * s continued to detect the silent part in the same manner as 

FIG. 15 is a state transition view typically showing described above but the search is continued until the quan- 

transition between respective modes in this embodiment as 60 tity of the lag exceeds the size of the ring buffer memory, 
described above. The way of mode switching on the basis of Thus, the output pointer return process at the time of the 

the switching operation will be understood well from FIG. pushing of the repeat switch 105 is completed. 

15. Further, the standby mode in FIG. 15 will be described FIGS. 17 and 18 are flow charts showing the procedure of 

later in detail. the one-frame waveform expansion/reduction process in this 

The details of processing operations in the respective 65 embodiment, 

routines in this embodiment described above will be In the one-frame waveform expansion/reduction process 

described below in detail in this embodiment, as shown in FIGS. 17 and 18, first of all, 
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power of current one-frame data is calculated (S171). Then, Then, the pitch extraction process is earned out again 

this power value P is compared with the threshold Th (S183). This is a process which is adapted to the fact that the 

(S172). A frame having higher power than the threshold Th human voice pitch always varies and in which the error 

is subjected to the following process. Data in a frame having between the actual pitch length and the pitch length for 

lower power than the threshold Th may be outputted intact 5 processing is reduced by extracting the pitch again to 

to be transferred to the ring buffer 24 (S173) or may be thereby consequently prevent the increase of distortion in 

subjected to a consonant emphasis process and then trans- waveform after expansion/reduction, 

f erred to the output buffers 26. Whether consonant emphasis Then, as shown in FIG. 18, the quantity of data twice as 

is to be performed or not to be performed is determined by much as the newly extracted pitch is compared with the 

mestateofmemodeswitchwWchisoneofhioVienswitches. number of not-yet-processed data (S184). If the quantity of 

As a specific method for realizing the consonant emphasis data erf the length of two pitches does not remain (the case 

process, there is, for example, considered a method in which of No in S184), this process is stepped immediately, 

a frame having lower power than the threshold Th just prior If the quantity of data larger than the length of two pitches 

to a frame having higher power than the threshold Th is remains (the case of Yes in S184), a post-transfer process is 

regarded as a consonant and the values of data in the frame carried out The post-transfer process means a process 

are increased. 15 similar to the pre-transfer process and corresponds to the 

In the case of a frame having higher power than the portion of (e) in FIG. 2 in the previous embodiment. The 
threshold Th in the aforementioned power judgment (the number of data to be transferred by the post-transfer process 
case of Yes in S172), first the number of data in one frame is set with the pitch as a unit but the number thereof varies 
is stored in a variable Z indicating the quantity of not-yet- in accordance with the waveform expansion/reduction rate, 
processed data (S174) and then the pitch extraction process 20 The number Npf is set (S185) by the parameter setting 
is carried out from the leading of the frame (S17S). Several routine which will be described later with reference to FIG. 
methods are considered as the pitch extraction process. For 19. After the pre-transfer process (S18tf), the quantity Z of 
example, the pitch length at the leading of the frame is not-yet-processed data is reduced by the number of trans- 
extracted on the basis of a well-known algorithm using f erred data (S1S7). 

autocorrelation. 25 The aforementioned procedure is continuously repeated 

Then, the quantity of data corresponding to twice the thus un til this procedure is stopped on the basis of comparison 

extracted pitch length is compared with the quantity of between the quantity of data of the length of two pitches and 

not-yet-processed data (S176), and in the case where the tne quantity of not-yet-processed data which comparison is 

quantity Z of not-yct-processed data is smaller the quantity pcrfarmed twice in the middle of this procedure, 

of data twice as much as the extracted pitch, this process is M mQ 19 {s a flow ^o^g mc procedure of the 

stopped. . „ , _ . . ^ parameter setting routine for setting parameter for the 

In the case where the quantity Z of not-yet-processed data F . 5 . ... K v 

ju uk, W uw « m / / r expansion process in this embodiment 

is eaual to or more than the quantity or data twice as mucn v At _ ... , . 

Sthe«ttactedpitch(thecaseofYeTm * 

process is carried out (S178). The pre-transfer process » « twice in the main > proc«s show m HGS. 13 and 

means a process in which a part of mput data is transferred 35 " Once thereof is used just before the aforementioned 

intact to the output buffer 26 before a reproduced wave one-frame waveform expansion/reduction routine and the 

pattern insertion process which will be described later. The other once is used in a "process for setting parameter for the 

pre-transfer process corresponds to the portion of (b) in FIG. reduction process" after the repeat end judgment 

2. The number of data to be transferred by the pre-transfer Hereupon, the waveform reduction process is a process 

process is set with the pitch as a unit but the number thereof 40 forrealizmgfoe"catehmg-u^ 

varies in accordance with the wave-form expansion/ which is continued after slow hearing or after repeating, 

reduction rate. The number Npf is set (S177) by a parameter When the generation of a reproduced wave pattern with use 

setting routine which will be described later with reference of the A window function as carried out in the waveform 

to FIG. 19. After the pre-transfer process is carried out expansion process is carried out while the position subjected 

(S178), the quantity Z of not-yet-processed data is reduced 45 to the window function is shifted in a direction reverse to the 

by the number of transferred data (S179). case of expansion, waveform reduction is obtained. 

Then, the position of application of a A window function In FIG. 19, first a discrimination is made between expan- 

for generating a reproduced wave pattern is determined sion and reduction (S191). This discriminates one of the 

(S180) in accordance with another parameter Ptri set in the aforementioned twice from the other, 

parameter setting routine shown in FIG. 19. What differs 50 In the case of parameter setting for the expansion process, 

between expansion and reduction is only the position on after this discrimination, the position of the speech speed 

current wave to which the window function is applied in the selection switch is checked (S192), the expansion rate e is 

case where a reproduced wave pattern is generated by using set in accordance with the position of the switch (S193), the 

the A window function. positions of parameters Npf and Npr used in the waveform 

That is, in the case of waveform expansion, as shown in 55 expansion process are set in accordance with the expansion 

HG. 2, the A window function is applied so that waveform rate e, and parameter Ptri mdicating the position of the start 

of the length of two pitches is generated from waveform of of weighted suirimation with respect to the A window as 

the length of one pitch (S181). Contrariwise in the case of carried out in the waveform expansion process is set, where- 

waveform reduction, as shown in FIGS. 20 to 22, the A after this routine is terminated. 

window function is applied so that waveform of the length 60 On the other hand, in the case of parameter setting for the 

of two pitches is generated from waveform of the length of reduction process, the right flow in FIG. 19 is carried out 

three or four pitches. The quantity of the lag from the real First, the position of the catching-up mode switch (which is 

time is changed by the insertion of the reproduced wave one of hidden switches) is checked (S196) and which of 

pattern (though not shown). "jump", "fast hearing" and "one-fold" the catching mode 

After the reproduced wave pattern insertion process, the 65 (Meat) is set to is checked (S197 and S198). 

quantity Z of not-yet-processed data is reduced by the When set to "jump", the catching mode (Meat) practically 

number of the thus processed data (S182). serves not to "catch up" but to jump actually just at the 
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the basis of comparison between the one-frame power and Further, when a lever is provided so that force : acts to 

the thresholdTh. Escape out of the continuous speech speed return the lever to the cento automatically when the user 

conversion mode is achieved by the pushing-down of the releases the finger's hold of this accelerator type switch, it 

reset switch 106 becomes easy for the user to keep the slide switch in another 

In this enibodimcnt, the speech is made slow or fast in 5 intermediate position than the center, so that an operating 

accordance with the power thereof. method easier to handle can be realized. Hereupon, the 

In ordinary conversation, there is generally a tendency production of the force to return this lever to the center can 

that an hnportant portion which must be told to a listener is be realized by two springs which are provided in the inside 

louder-voiced but a portion not so important is smaller- of a switching device to give thereto a mechanical means 

voiced. Accordingly, the speech speed control in this 10 such as means of pulling the lever by uniform force from the 

embodiment is characterized in that an output voice nearer opposite sides, and so on. . « 

to the natural voice is obtained. Referring next to FIG. 28, there is a block diagram 

The probability of appearance of the high-power portion showing the functional structure of a speech speed conver- 

and the probability of appearance of the low-power portion sion apparatus provided with an AV control means. Refer- 

are however not always equal to each other, so that catching is ring to FIG. 29, there is a view for explaining me oration 

up to the real time at intervals of a predetermined time as in of the AV control means in this embodiment of FIG. 28. 

the case of the previous embodiment of FIGS. 23 and 24 is Referring to FIGS. 30 and 31, there are flow charts showing 

not always ensured. operating procedure of the rnain process m the speech 

Further, as a method of instruction from the user to attain speed conversion apparatus provided with the AV control 

entry into the continuous speech speed conversion mode, 20 means in this embodiment. 

there are considered a method in which the slow switch As shown in FIG. 28, the speech speed conversion 
(slow pushbutton) 104 is pushed and then slid laterally to apparatus provided with the AV control means in this 
thereby be locked, a method in which the slow switch (slow embodiment is provided as a functional structure in which 
pushbutton) 104 is double-cHcked (pushed down twice in an AV controller 28 is added to the functional structure of the 
succession at a short time interval), and so on. If these 25 speech speed conversion apparatus in the aforementioned 
methods are used, the respective intentions of "executing embodiment shown in FIG. 11 and connected to the con- 
slow reproduction" and of "continuing" the operation by the troller 23E. u *u 
pushing of the slow switch (slow pushbutton) 104 can be The aforementioned controller 23E judges whether a 
expressed in difference in the way of pushing of the same condition for outputting an AV control signal is satisfied or 
pushbutton so that there can be provided an operating 30 not and operates the AV controller 28 to start/stop the 
system which is more intuitive and easier to understand outputting of the AV control signal 
compared with the case where a continuous speech speed As shown in FIG. 29, the AV control means is a software 
conversion pushbutton is provided separately. in which the AV control signal is outputted when the 
The embodiments up to now have been described above quantity of the lag from the real time as caused by slow or 
upon the assumption that the waveform "expansion rate" in 35 repeat reproduction exceeds a predetermined value (30 
thT case of "slow" reproduction based on the waveform seconds in FIG. 29) and in which the outputting of the same 
expansion process is determined by the setting of the signal is stopped when the lag quantity then reaches zero via 
"speech speed setting switch" provided on the apparatus and catching-up reproduction. 

that a "default value" determined (in the program) in TneAV control signal is picked out of mis apparatus and 

advance is used as the waveform "reduction rate" in the case 40 used for temporarily stopping the reproducing operation of 

of "fast" reproduction based on the waveform reduction a recording/reproducing apparatus such as a tape recorder a 

^ occss video tape recorder, or the like. By this means, it is made 

The function of "making freely coming and going on the possible to continue the slow hearing of an input voice 

time axis of the speech possible" provided by this apparatus, which is continued for such a long time that exceeds the 

however, can be used by the user more intuitively when an 45 capacity of the ring buffer 24 in this apparatus. ^ 

"accelerator type switch" shown in FIG. 27 is provided so In FIGS. 30 and 31, the portion surrounded by the fcoken 

mat the waveform expansion/reduction rate is changed by line is a step showing the operating procedure of the AV 

^ switch control means added to the flow charts in FIGS. 12 and 13. 

When the accelerator type switch is set to the center, the In this step, a judgment is made as to whether the condition 

through mode in the aforementioned embodiments is 50 to outputting the AV control signal is satisfied or not (S301). 

executed. When the slide switch is pulled to the front, the The judgment with respect to the outputting of the AV 

waveform expansion process is employed so that "slow control signal is realized by a judgment as to whether the 

reproduction" is executed with a lag from the real time. quantity of the lag from the real time in a loop in which the 

When the slide switch is then pushed to the back, the one-frame waveform expansion/reduction process is 

waveform reduction process is employed reversely so that 55 repeated for slow or repeat reproduction is over 30 seconds 

fat reproduction is executed (until the lag from the real time or not (S301) and by starting the outputting of the AV control 

reaches zero). signal when the lag from the real time is over 30 seconds 

During this, the controller changes the waveform (S302). 

expansion/reduction rate in accordance with the distance On the other hand, the process of stopping the AV control 

from the center of the slide switch. As is obvious from the 60 signal is carried out just after the judgment "lag quantity^ 

explanation of the aforementioned embodiment of FIGS. 20 Or which is a judgment for escape out of the loop of the 

through 22, the expansion/reduction rate however can be set catching-up reproduction process (S303). 

to no value but several values which can be expressed in Referring next to FIGS. 32A, 32B and 32C, there are 

integer rates. In practice, therefore, the expansion/reduction views for explaining the arrangement of a microphone in a 

rate may be preferably set so that several stages of values 65 speech speed conversion apparatus according to the present 
can be selected in accordance with the distance from the invention. The reference numeral 101 designates a body of 

center of the slide switch. &e speech speed conversion apparatus; 321, a microphone; 
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moment that the hold of the slow switch (slow pushbutton) is checked (S237). In the case where the lag quantity is zero 
Is released (S199). Specifically, a branching process for (the case of Yes in S237), the situation of the routine goes 
forcedly returning back to the through mode is carried out in back to the step S232. In the case where the lag quantity is 
this portion. not zero (the case of No in S237), parameter is set for the 

When the catching-up mode (Meat) switch is set to 3 expansion process (S238) and the frame counter is reset 
"one-fold", the reduction rate s is set to one-fold (S200) and (S239) whereafter the situation of the routine goes back to 
the situation of the routine goes to step 202. the step S232 so that (he continuous speech speed conver- 

When the catching-up mode (Meat) switch is not set to sion operation is repeated. In the case where the continuous 
"one-fold", me reduction rate sis set through the center flow speech speed conversion process is not intended in the 
in FIG. 19 at the time of the catching-up mode (S201), the 10 aforementioned step S231 (the case of No in S231), the 
values of parameters Npf and Npr used in the waveform mode is shifted to the aforementioned main process routine 
reduction process are set in accordance with the reduction (through mode). 

rate s (S202) and, further, parameter Ptri indicating the That is, the continuous speech speed conversion means in 
position from wtech wdghted suniination with respect to the this embodiment is a method in which slow reproduction 
A window as carried out in the waveform reduction process 15 and catching-up reproduction are repeated alternately at 
is started is set (S203), whereafter this routine is terminated. intervals of a preliminarily set time. According to this 

FIGS. 23 and 24 are flow charts showing the procedure of method, catching up to the real time at intervals of a 
the total operation of a speech speed conversion apparatus predetermined time is always made possible. The manage- 
provided with a continuous speech speed conversion means ment of the changeover between waveform expansion and 
according to the present invention. 20 waveform reduction is performed on the basis of the count 

As shown in this embodiment of FIGS. 23 and 24, of the number of frames. For example, when the expansion 
continuous speech speed conversion in the speech speed process for a number of frames corresponding to about five 
conversion apparatus provided with the continuous speech seconds is completed, the reduction process is then carried 
speed conversion means is substantially an operation in out repeatedly, and when the lag quantity reaches zero, the 
which the pushing of the slow switch (slow pushbutton) 104 25 frame count is returned to zero and the expansion process is 
is continued so that slow reproduction is continued. The time repeated again. 

lag is however accumulated rapidly when waveform expan- Further, escape out of the continuous speech speed con- 
sion at a constant waveform expansion rate is continued, so version mode is achieved by the pushing-down of the reset 
that the quantity of the lag from the real time finally exceeds switch 106 to return the mode to the through mode, 
the capacity of the ring buffer 24 to make it impossible to 30 Referring next to FIGS. 25 and 26, there are flow charts 
continue slow hearing any more. showing the procedure of the total operation of a speech 

The continuous speech speed conversion means is there- speed conversion apparatus provided with a continuous 
fore provided to mix a waveform expansion period and a speech speed conversion means different from that in the 
waveform reduction period reverse thereto at the time of embodiment shown in FIGS. 23 and 24. 
slow reproduction so that the lag from the real time is not 35 The continuous speech speed conversion in the speech 
increased rapidly. speed conversion apparatus provided with the continuous 

Although several methods are considered as means for speech speed conversion means in this embodiment is an 
changeover to the continuous speech speed conversion operation for applying waveform expansion to a frame of 
mode, it is rather easy to understand that a clear distinction high power and applying waveform reduction to a frame of 
is made between the case where the pushing of the slow 40 low power. 

switch is continued simply for a considerably long time and In the continuous speech speed conversion means in this 
the case where entry into the continuous speech speed embodiment, whether the continuous speech speed conver- 
conversion mode is intended Accordingly, this changeover sion process is intended or not is checked in step S251 in 
is realized, for example, by using switching parts by which FIGS. 25 and 26. If the continuous speech speed conversion 
the slow switch is locked when double-clicked (pushed 45 process is intended (the case of Yes in S251), a judgment is 
twice at a short time interval) or when slid laterally while made as to whether the reset switch 106 is pushed (turned 
pushed. on) or not (S252). In the case where the reset switch 106 is 

The respective steps in the flow charts shown in FIGS. 23 not pushed (turned off), one-frame power is calculated 
and 24 in this embodiment are quite the same as in the (S253). Then, whether the calculated one-frame power is 
procedure of the main process described above with refer- 50 higher than the threshold Th or not is checked (S254). In the 
ence to FIGS. 13 and 14. case where the calculated one-frame power is lower man the 

In the continuous speech speed conversion means in this threshold Th (the case of No in S251), parameter is set for 
embodiment, whether the continuous speech speed convex- the reduction process (S256) and the situation of the routine 
sion process is intended car not is checked in step S231 in goes to step S257. In the case where the calculated one- 
HGS. 23 and 24 (S231). If the continuous speech speed 55 frame power is higher than the threshold Th (the case of Yes 
conversion process is intended (the case of Yes in S231), the in S254), parameter is set for the expansion process (S255) 
one-frame waveform expansion/reduction process is carried and the one-frame waveform expansion/reduction process is 
out (S232). Then, a judgment is made as to whether the reset carried out (S257) whereafter the situation of the routine 
switch 106 is pushed (turned on) or not (S233). In the case goes back to the step S252 so that the continuous speech 
where the reset switch 106 is not pushed (turned off), 60 speed conversion operation is repeated. In the case where the 
counting up by one frame is performed (S234) and a continuous speech speed conversion process is not intended 
judgment is made as to whether the expansion period is in the aforementioned step S251 (the case of No in S251), 
intended or not (S235). If the expansion period is intended the mode goes to the aforementioned main process routine 
(the case of Yes in S235), the situation of the routine goes (through mode). 

back to the step S232. If the expansion period is not intended 65 That is, when entry into the continuous speech speed 
(the case of No in S235), parameter is set for the reduction conversion mode is made, one-frame power is calculated so 
process (S236). Then, whether the lag quantity is zero or not that either expansion or reduction is applied to each frame on 
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322, a prop capable of expansion and contraction for sup- of the speech speed conversion apparatus body shown in 
porting the microphone 321; 323, a flexible prop for sup- FIG. 6 and by the display of the display screen as shown in 
porting the microphone 321; and 324, an electric cord for FIG. 34 on the liquid crystal display. Further, this display 
electrically connecting the microphone 321 to the speech portion is controlled by a "liquid crystal display driver" (not 
speed conversion apparatus body 101 by wire. 5 shown) connected to the controller 23E in FIG. 11. 

FIG. 33 is a view showing a modified example of this Because the quantity of the time lag to be displayed is 
embodiment of FIG. 32, in which the reference numeral 101 continuously managed by the lag quantity counter in the 
designates a body of the speech speed conversion apparatus; main process shown in FIGS. 13 and 14, the numerical value 
104 a slow switch; 105, a repeat switch; 106, a reset switch; of this lag quantity counter can be converted at the conver- 
321, a microphone; 324, an electric cord for electrically 10 sion rate of one to 10 seconds so that a corresponding 
connecting the microphone 321 to the speech speed conver- number of human images can be indicated on the aforemen- 
sion apparatus body 101; 325, an earphone; and 300, a tioned display. This displaying operation is earned out by 
connection member. the controller 23E in FIG. 11 through the aforementioned 

In the arrangement of the microphone in the speech speed display driver and the timing of rewriting the display is 
conversion apparatus of this embodiment, as shown in FIG. 15 sufficient as long as the rewriting is performed whenever the 
32A, the microphone 321 is supported by the prop 322 processing of one frame is completed. For example, this 
capable of expansion and contraction. Because the afore- displaying process is carried out between the steps S137 and 
mentioned supporting of the microphone 321 makes the S138 in FIG. 14. 

microphone 321 far away from the speech speed conversion Referring to FIG. 35, there is a view for explaining an 
apparatus body 101, the rustle of clothes can be prevented 20 electric source device in a speech speed conversion appa- 
from being produced when the apparatus body is put into a ratus according to the present invention. The reference 
breast pocket in use. numeral 1000 designates an apparatus portion concerning 

Alternatively, as shown in FIG. 32B, the microphone 321 the speech speed conversion apparatus; 1, a DSP; 5, an A/D 
is supported by the flexible prop 323. Being supported by converter; 6, a D/A converter, 9, an analog amplifier; 10, an 
such a manner, the microphone 321 is separated from the 25 analog amplifier; 1001, an electric source; 1002, an electric 
speech speed conversion apparatus body 101, and can be power supply line; and 1003, a changeover switch, 
bent in a desh*ed direction. Accordingly, the rustle of clothes In the speech speed conversion apparatus in this 
can be prevented from being produced when the apparatus embodiment, as shown in the state transition view of FIG. 
body is put into a breast pocket in use. 15, a standby mode is provided besides the through mode so 

Further, as shown in FIG. 32B, the microphone 321 and 30 that entry into the standby mode is made automatically when 
the speech speed conversion apparatus body 101 are elec- the through mode is continued for a predetermined time, 
trically connected to each other by wire (or wireless). The That is, when either slow switch or repeat switch is pushed 
STS ratio can be improved because the microphone 321 and (turned on), clock frequency is heightened so that each 
the speech speed conversion apparatus body 101 are elec- process is carried out. 

trically connected to each other by wire (or wireless) as 35 Further, in the through mode, the DSP 1 operates with fast 
described above so that the microphone 321 is disposed near clock but power is wasteful because the speech speed 
the listener independently of the speech speed conversion conversion process or the like is not executed. In the standby 
apparatus body 101. mode, therefore, the operating clock for the DSP 1 is 

Further, as shown in FIG. 33, the speech speed conversion lowered so that only I/O of data is performed to thereby 
apparatus body 101 and the microphone 321 are electrically 40 reduce consumed electric power. Further, only storage into 
connected to each other by the electric card through the the memory is performed. In this manner, a voice memory 
earphone 325 and the connection member 300. Further, function is realized. 

operation switches such as the slow switch 104, the repeat Further, as shown in FIG. 35, at the time of an analog 
switch 105, the reset switch 106, and so on, are provided on through mode, the changeover switch 1003 is connected to 
the aforementioned connection member 300. In mis manner, 45 a contact side to cut off the electric power supply line 1002 
not only the rustle of clothes can be prevented from being and also connected to a contact side to connect the analog 
produced when the apparatus body is put into a breastpocket amplifiers 10 and 9 directly so that electric power is not 
in use but also both the S/N ratio and the handling property supplied to the DSP 1, the A/D converter 5, the D/A 
can be improved. converter 6 and peripheral digital circuits. At this time, the 

Referring next to FIG. 34, there is a view for explaining 50 storage into the memory is not performed. That is, VO 
a lag time display means in a speech speed conversion analog systems are connected directly so as to be operated 
apparatus according to a furmer emrx>diment of me present simply as an analog amplifier. The aforementioned 
invention. The reference numeral 341 designates a display changeover switch is provided as a switch of three stages, 
portion; and 342, a display screen. namely, an ON stage, an OFF stage and an ON-OFF 

As shown in FIG. 34, the lag time display means in this 55 intermediate stage as shown in FIG. 35, so that the analog 
embodiment displays how much the speech of a speaker is through mode is provided. 

delayed from the real speech speed at the time of the As is obvious from the above explanation, in accordance 
aforementioned slow/repeat reproduction. For example, with this embodiment, a switch of three stage consisting of 
assuming that one human image represents the time lag of 10 an ON stage, an OFF stage and an ON-OFF intermediate 
seconds in FIG. 34, then the time lag from the current time 60 stage is formed so that the analog through mode is provided, 
is expressed in the number of displayed human images. In Accordingly, not only reduction in electric power can be 
mis manner, me quantity of time lag from the current time attained but also the range of use of the electric source can 
is recognized visually. Accordingly, both speaker and lis- be widened. 

tener can adjust the speech speed conversion easily, so that Referring to FIG. 36, an embodiment in which the speech 
this apparatus can be used so as to be easy to handle. 65 speed conversion means according to the present invention 
The visual display of the time lag is realized, for example, is applied to a telephone will be described. The reference 
by the provision of a liquid crystal display in the front center numeral 2000 designates the speech speed conversion means 
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according to the present Invention; 3000, a body of the (7) Aplurality of persons hear one source simultaneously at 

telephone; 3001, a transceiver; and 3002, a telephone line. their favorite speech speeds. 

As shown in FIG. 36, the telephone in this embodiment is Further, in combination with a digital audio apparatus 

formed by inserting the speech speed conversion means such as a tape recorder, a CD recorder, an MD recorder, and 

2000 according to the present invention between the handset 5 so on, it is unnecessary that any A/D converter is provided 

3001 and the telephone body 3000. The speech speed in the speech speed conversion apparatus as long as the 

conversion means 2000 is, for example, shaped like a mount audio apparatus has a digital output 

on which the telephone body 3000 is put Further,in the music purpose, the present invention can be 

Further, in the case of a cordless handset or cordless child applied as long as changes are made as follows, 

transceiver 3001, the speech speed conversion means 2000 10 The judgment based on the power of an expanded frame 

is inserted between the transceiver 3001 and the telephone is not carried out (because tempo is disordered), 

body 3000 by wireless. The pitch extraction range is widened compared with the 

Further, the speech speed conversion means according to case of a voice, 

the present invention may be used as a speech speed The waveform expansion process is earned out on the 

conversion means in a switching system so that it can be 15 basis of the pitch of a fixed length. In the case of a voice 

operated at the user's request thereof, the pitch is detected so that processing is made on 

In the aforementioned structure, a voice can be heard over the basis of the detected pitch, 

the telephone slowly. Further, because the voice is fed back A foot switch is provided so that a converting operation 

as a through voice to the speaker side as well as the voice can can be carried out by the foot switch. This makes it possible 

be heard slowly to the listener so that the speaker can speak 20 to control a music instrument while playing, 

ordinarily at the time of telephone conversation with the Although the present invention has been described spe- 

aged or the like, there is no fear of hard speaking. cifically on the basis of the enibodiments thereof, it is a 

Further, it is unnecessary that any A/D means is provided matter of course that the present invention is not limited to 

in the inside of the speech speed conversion means as long the aforementioned embodiments and that various changes 

as the speech speed conversion means is provided as a 25 may be made without departing from the spirit thereof, 

digital circuit. In brief, effects obtained by typical embodiments of the 

Referring to FIG. 37, description will be made as to an present invention disclosed in this application are as follows, 

embodiment in which the speech speed conversion means (1) Because the speech speed conversion apparatus can be 

according to the present invention is applied to a premises used not only for a voice one-sidedly given to the listener 

broadcasting system. The reference numeral 2000 desig- 30 such as a radio voice but also in the situation of 

nates the speech speed conversion means; 321, a micro- conversation, a voice to be subjected to speech speed 

phone; 325, an earphone; 4003, an amplifier; and 4004, a conversion can be selected by the listener without any 

speaker. disturbance of listener's own speech. 

In the telephone in this embodiment, as shown in FIG. 37, Further, in hearing aids, foreign language learning 

the speech speed conversion means 200 according to the 35 machines, telephones, and so on, talker's voice can be heard 

present invention is inserted between the microphone 321, at a slow speech speed without any change of the charac- 

thc earphone 325 and the amplifier 4003 for the speaker teristic of the voice. 

4004 (2) Effective use of the memory, a raw voice repeat function, 

In the aforementioned structure, the listener can hear a a voice memory function, a repeat voice speech speed 

voice at a suitable speech speed even in the case where the 40 conversion function, a fast-hearing reproduction function, 

speaking person does not control the speech speed conver- and so on, can be provided. 

sion operation. For example, even in the case where the (3) Because means for changing the speech speed to a value 

speaking person talks volubly at a high speech speed selected by the speech speed selection switch is provided, 

(impetuous speed) selfishly, the listener can hear at a suitable the speech speed of a voice to be heard can be selected by 

speech speed. 45 the listener's own will. 

Further, the listener can hear at a suitable speech speed (4) Because means for repeating a reproduced voice in a 

from a speaker even in the case where a speaking person period in which the repeat switch is turned on is provided, 

speaks slowly. me speech speed of the repeat voice can be converted. 

As is obvious from the above explanation, the present (5) Because a catching-up means for catching up to a 

invention can be applied to technical fields requiring speech 50 position of stored information to be heard is provided in 

speed conversion, such as for example hearing aids, learning the speech speed conversion apparatus, widening of the 

of languages, abroad traveling, music, and so on, besides range of application of the speech speed conversion 

telephones, telephone line switching systems and premises apparatus, reduction in operating time, improvement in 

broadcasting. handling property, and so on, can be attained. 

For example, in goods for learning of languages and for 55 (6) Because at least one of the speech speed conversion 

abroad traveling, the present invention can be applied to the switch, the speech speed selection switch, the repeat 

following cases. switch and the reset switch is provided in a peripheral 

(1) A voice recorded is heard continuously and slowly. portion which is on a side surface of the speech speed 

(2) The expansion rate is changed in accordance with the conversion apparatus and easy to handle, widening of the 
improvement in level. 60 range of application of the speech speed conversion 

(3) A portion which was hard to understand when heard at apparatus, reduction in operating time, improvement in 
an ordinary speed is heard repeatedly and slowly. handling property, and so on, can be attained. 

(4) After heard slowly, a voice is heard again at its original (7) Efficiency in the speech speed conversion process can be 
speed. improved. 

(5) After slow repeat, pronounce is imitated. 65 (8) Because not only the determination of the waveform 

(6) An imitated voice is heard in comparison with its original expansionAeduction process and of the silent-part elimi- 
voice. nation process in the speech speed conversion process is 
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made on the basis of comparison between the power of a output means far outputting an output of said audio signal 

frame and the threshold but also the threshold is changed processing means as an audio signal 

in accordance with the loudness of the input voice, the 2. An apparatus according to claim 1, further comprising: 

speech speed conversion process can be carried out in means fox controlling at least an audio signal derived from 

accordance with the environmental condition in use. 5 an audio/video apparatus coupled to said audio signal 

(9) Because the microphone docs not catch click noise of storing apparatus. ^ * 
switches, the reproduced voice can be heard accurately. 3- An apparatus according to claim 1, wherein, when said 

(10) Because the switches have respective surface forms repeat mode is selected by said switch means, said audio 
which arc different in tactility so as to be identified fW* * readout from a poatom where die audio signal had 
without seeing, handling property can be improved 10 ^ J"** * ^ b ? a ^terrmned time and 

j , <T tT 6 c ^ a%- Jr A r outputted the read out audio signal. 

(11) Because means for preventing the rustle of clothes J An apparatus according to claim 1, wherein there are at 
against the microphone is provided, the entrance of noise ^ fipeeds of ^ output audio signal in said 

can be reduced. repeat mode among a speed same as the input speed, a slow 

(12) Because display means capable of visually indicating speed lower ^ ^ speedj a fesl ^4^^ man the 
the quantity of a time lag from me cmrenttirne is provided is input spccd ^ ^ a gradually accelerated speed. 

in a predetermined position of the speech speed conver- 5 apparatus according to claim 1, further comprising: 

sion apparatus, reduction in operating time, improvement a ^h^p m ofe for adjusting a quantity of a time lag 

in handling property, and so on, can be attained. from the input audio signal in real time so as to catch 

(13) Because a ring buffer is used as a memory means so that U p ^th the real time audio signal when the time lag is 
means far managing the lag rime by a counter indicating 20 caused by the repeat mode or the speech speed chang- 
the time lag on the ring buffer is provided, complex ing mode. 

calculation of pointer addresses in the repeat process, the 6. An apparatus according to claim 5, wherein said 

catching-up process, and so on, can be performed easily. catch-up mode starts during execution of said repeat mode 

(14) Because a standby mode and an analog through mode or said speech speed changing mode, and is changed to said 
are provided besides the through mode, reduction in 25 through mode when me output audio signal catches up to the 
consumed electric power can be attained. real time input audio signal. 

(15) Because the electric source switch is provided in the 7. An apparatus according to claim 6, wherein a message 
form of three stages consisting of an stage, an stage and signal is outputted when the output audio signal catches up 
an OFF intermediate stage so that the analog through to the real time input audio signal. 

mode is provided, not only reduction in electric power can 30 8. An apparatus according to claim 5, wherein each of said 

be attained but also the range of use of the electric source means forming said apparatus is disposed on a surface of a 

can be widened. body of said apparatus or at an inside location of said body, 

(16) Because the aforementioned speech speed conversion said apparatus has, as said switch means, a repeat switch for 
means is provided between the handset of a telephone and instructing said repeat mode, a speech speed change mode 
the apparatus body, a voice to be subjected to speech 35 switch for instructing said speech speed change mode, and 
speed conversion can be selected by the listener without a reset switch for cancelling speech speed change mode and 
any disturbance of the listener's own speech. transferring to said through mode, said switches being 

(17) A talker's voice over the telephone can be heard at a disposed at a easy-to-use place on a periphery of a side of 
slow speech speed without any change of the character- said body. 

istic of the voice, 40 9. An apparatus according to claim 8, wherein said audio 

(1 8) Because the speech speed conversion means is provided signal input means is disposed on a top portion of said body 
in a telephone line switching system, a voice to be so as to prevent said audio signal input means from directly 
subjected to speech speed conversion can be selected by touching clothes when said apparatus body is put into a 
the listener without any disturbance of the listener's own pocket while in use. 

speech. 45 10. An apparatus accordmg to claim 5, wherein, when said 

What is claimed is: reset switch is turned on during said catch-up mode, the 

1. An audio signal storing apparatus having a function for catch up mode is transferred to said through mode, 

converting speech speed, comprising: 11> An apparatus according to claim 5, wherein said 

audio signal in pu t means for taking i n an audio signal; catchup mode is started during execution of said repeat 

^ ™JI~TfL f^.w. M w 1 «5^n — * 50 mode or said speech speed changing mode, and is trans- 

a memory for storing said audio sigiST^ ferr ed to said through Tmode when the audio signal is 

an audio signal processing means for executing one of the rcproduccd by mc rcpcat mode indication time point 

following processing modes: u ^ apparatus according to claim 1, further compris- 

a through mode for storing the audio signal into said ^ 

memory while the audio signal is read out from said M mcaDS for hcaring ^ audio signal from said 0ll tp Ut 

memory without changing the audio signal speed, mcans through a binaural headphone. 

a repeat mode for reading out the audio signal stored in 13. An apparatus according to claim 1, wherein each of 

said memory in the past and for outputting the past sa id switches is formed with a switch which has a feeling of 

audio signal read out from said memory without chang- 50 ft touch so that said audio signal input means does not pick 

ing the audio signal speed, and 60 up a click noise of the switch. 

a speech speed changing mode for executing a speech 14. An apparatus according to claim 1, wherein each of 

speed changing process for the audio signal read out said means forming said apparatus are disposed on a surface 

from said memory; of a body of said apparatus or at an inside location of said 

switch means for causing said audio signal processing body, and said apparatus further comprises a rustling pre- 

means to execute as said speech speed changing pro- 65 vention means for preventing said audio signal input means 

cess one of said three processing modes of said audio from directly touching clothes when said apparatus body is 

signal processing means; and put into a pocket while in use. 
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15. An apparatus according to claim 1, wherein each of 
said means forming said apparatus is disposed on a surface 
of a body of said apparatus or at an inside location of said 
body, and said apparatus further comprises a display means 
which is provided at a predetermined position of said 5 
apparatus to thereby visually indicate a quantity of time lag 
from the real time produced by execution of said repeat 
mode or said speech speed changing mode. 

16. An apparatus according to claim 15, wherein a ring 
buffer is used as said memory means, and said apparatus 10 
further comprises means for managing a lag time by a 
counter which indicates a time lag on said ring counter, a 
result of said managing means being displayed by said 
display means. 

17. An apparatus according to claim 1, wherein said audio 15 
signal processing means further executes a standby mode, 
wherein a clock cycle for executing the process is decreased 
and the same process as in the through mode is executed. 

18. An apparatus according to claim 1, further compris- 
ing: 20 

an electric source switch of an apparatus body operated at 
three stages consisting of an ON stage, and OFF stage 
and an ON-OFF intermediate stage, and 

an electric power supply means operated in an analog 
through mode in which analog input-output systems are 25 
short-circuited so as to be directly connected to each 
other to thereby stop electric power supply to a digital 
processing system between said analog input-output 
systems when said electric source switch is in the ^ 
intermediate stage. 

19. A telephone line switching system having said speech 
speed changing means as defined in claim 1. 

20. An apparatus according to claim 1, wherein, in said 
through mode, the audio signal read out from said memory 
is outputtcd without changing not only the audio signal 
speed but also other characteristics of said audio signal 

21. An apparatus according to claim 1, wherein said audio 
signal processing means is controlled in such a manner that 
an intermittent sound is outputted until the audio signal 
stored in said memory in the past is read out from saidrepeat 
mode selected by said switch means. 

22. An apparatus according to claim 1, wherein each of 
said means forming said apparatus is disposed on a surface 
of a body of said apparatus or at an inside of said body, said 
apparatus has, as said switch means, a repeat switch for 
instructing said repeat mode, a speech speed change mode 
switch for instructing said speech speed change mode, and 
a reset switch for cancelling said speech speed change mode 
and for transferring to said through mode, said switches 
being disposed at a easy-to-use place on a periphery of a side 50 
of said body. 

23. An apparatus according to claim 22, wherein said 
switches have respective surface forms different in tactility 
so as to be identified without seeing. 

24. An audio signal storing apparatus having a function 
for converting speech speed, comprising: 

a microphone for converting an acoustic signal into an 
electric signal; 

a first analog amplifier for amplifying an output of said w 
microphone; 

a first low-pass filter for removing high-frequency com- 
ponents from an output of said analog amplifier; 

an A/D converter for converting an output of said analog 
amplifier into a digital signal; 63 

a memory means for storing input speech data and data 
obtained as a result of signal processing; 



a digital signal processor for reading out data from said 
memory means and for carrying out digital signal 
processing to execute a speech speed changing process 
for the acoustic signal in accordance with an external 
speech speed conversion instruction; 

means for controlling said speech speed changing process 
executed by said digital signal processor; 

means for changing a parameter of said speech speed 
changing process; 

selecting means for receiving said speech speed conver- 
sion instruction and causing said digital signal proces- 
sor to execute as said speech speed changing process 
one of the following processings: 

a first processing for storing input speech data into said 
memory means while the speech data are read out from 
said memory means, and for outputting the speech data 
read out from said memory means without changing the 
speech speed, 

a second processing for reading out the speech data stored 
in said memory means in the past and for outputting the 
past speech data read out from said memory means 
without changing the speech speed, and 

a third processing for executing said speech speed chang- 
ing process for the speech data read out from said 
memory means; 

a D/A converter for converting digital speech data out- 
putted from said digital signal processor into an analog 
speech signal; 

a second low-pass filter for removing high-frequency 
components from outputs of said D/A converter; 

a second analog amplifier for amplifying an output of said 
second low-pass filter, and 

a head-phone for converting an output of said second 
analog amplifier into an acoustic signal and for sup- 
prying the acoustic signal to ears of a user. 

25. An apparatus according to claim 24, wherein said third 
processing is provided as a software executed by said digital 
signal processor having an input terminal for receiving an 
interruption request signal from the outside, so that an 
instruction for selection of one mode among said first to 
third processings by said selecting means is provided to said 
digital signal processor via the input terminal receiving said 
interruption request 

26. An audio signal storing apparatus having a function 
for converting speech speed, comprising: 

audio signal input means for taking in an audio signal; 

a ring buffer for storing said audio signal; 

an audio signal processing means far executing one of the 
following processing modes: 

a through mode for storing audio signal into said ring 
buffer while the audio signal is read out from said ring 
buffer and for outputting the audio signal read our from 
said ring buffer without changing the audio signal 
speed, 

a repeat mode for reading out the audio signal stored in 
said ring buffer in the past and for outputting the past 
audio signal read our from said ring buffer without 
changing the audio signal speed, and 

a speech speed changing mode for executing a speech 
speed changing process for the audio signal read out 
from said ring buffer; 

wherein, in said speech speed changing mode, average 
power of said audio signal is calculated in each input 
frame unit, said speech speed changing process being 
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executed only in a case where said average power is 
higher than a predetermined threshold value, and the 
audio signal of frame unit being directly outputted in a 
case where said average power is lower than said 
predetermined threshold value, 
said speech speed changing mode being executed as a 
pipeline process by each frame unit with use of a 
plurality of input frame buffers in such a manner that 
for data of every frame a pitch extraction process is 
applied to a leading portion of the frame to detect a 
pitch of the leading portion, data of a length of one 
pitch thus detected is transferred to output buffers, data 
of a length of two pitches is multiplied by a window 
function which changes from 0 to 1 and by a window . 
function which changes from 1 to 0, respective data 15 
obtained by the multiplications by the window func- 
tions are added together to thereby generate a repro- 
duced wave pattern having a time length of two pitches, 
the reproduced wave pattern being inserted in the rear 
of the preliminary transferred data of the length of one 20 
pitch, a pitch detection process is again carried out 
while spearheaded by a position preliminary subjected 
to the pitch extraction process to thereby perform pitch 
detection at said position, and data of the length of n 
pitches (n is an integer) based on the pitch length 25 
obtained by the final pitch detection to the output 
buffers; 

a switch far causing said audio signal processing means to 
execute as said speech speed changing process one of ^ 
said processing modes of the audio signal processing 
means; and 

output means for outputting an output of said audio signal 
processing means as an audio signal. 

27. An apparatus according to claim 26, wherein said ring 35 
buffer stores the audio signal by each frame unit 

28. An apparatus according to claim 26, wherein deter- 
mining of waveform expansion/reduction processes in said 
speech speed changing process is made by referring to 
results of comparison between power of frame and a thresh- ^ 
old value which is externally adjustable. 

29. An apparatus according to claim 26, wherein a second 
threshold value is provided in the comparison process for 
comparing said average power with the predetermined 
threshold value so that when a frame having lower average 43 
power than said second threshold value is continued for a 
longer time than a predetermined time threshold, data in the 
frame having lower average power than the second threshold 
value and continued far a longer time than said time thresh- 
old are forbidden to be transferred to the output buffers. 

30. A speech speed conversion apparatus for receiving an 
input speech and for changing output time without changing 
a pitch of said input speech, comprising: 

a portable case having a speech input microphone dis- 
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smaller than a first threshold value and the period is 
longer than a predetermined time length, 
a memory for storing the compressed speech data in an 

order of time series, 
decompressing means f or reading out the compressed 
speech data of me predetermined length from said 
memory and for releasing said compressed speech data 
from the compression, 
speech speed converting means for outputting the 
decoded speech data in response to an instruction from 
said output control switch and for converting the speed 
of said speech data in such a manner that the decom- 
pressed speech data, in a period where the power of 
speech data is larger than a second threshold value, is 
rnaintained in constant pitch and the time axis is 
extended longer than the input time of said period; and 
output means far supplying an output from said speech 
speed converting means to said speech output terminal. 

31. An audio signal storing apparatus having a function 
for converting speech speed, comprising: 

audio signal input means for taking in an audio signal; 
a memory for digitizing said audio signal and storing the 

digitized audio signal in accordance with a writing 

pointer; 

an audio signal processing means for executing one of the 
following processing modes for the audio signal read 
out from a reading pointer position of said memory: 

a through mode for oufcutting a read-out audio signal 
from said memory in such a manner that a reading 
pointer position is same as a writing pointer position, 

a repeat mode for outputting the audio signal read out 
from said memory by setting the reading pointer posi- 
tion which is returned back by a predetermined time 
from the writing pointer, and 

a speech speed changing mode for outputting the audio 
signal by executing a speech speed changing process 
for the audio signal read out from said memory in such 
a manner that the reading pointer position is gradually 
delayed from the writing pointer, 

switch far causing said audio signal processing means to 
execute as said speech speed changing process one of 
said three processing modes of said audio signal pro- . 
cessing means; and 

output means for converting the output of said audio 
signal processing means to an analog signal and out- 
putting said analog signal as an audio signal. 

32. An apparatus according to claim 31, wherein the 
reading pointer position and the writing pointer position are 
same with each other at a start point of said speech speed 



changing mode. 

_ _ 33. An apparatus according to claim 31, wherein the 

posed on a surface of the case, an output control switch, 55 reading pointer position is delayed in time from the writing 

and a speech output terminal, said case being as small p OU1 ter position at a start point of said speech speed chang- 

as a palm; ing mode, 

said case includes: 34. An apparatus according to claim 31, wherein a time 

compressing means couoled to sai d input microphone for span between the reading pointer position and the writing 

' digitally compressing said input speech in each prede- 60 pointer position is constant in said repeat mode, 
termined length, said compressing of the input speech 

being deleting of data bit, wherein a speech power is ***** 



03/17/2004, EAST Version: 1.4.1 



